Evaluations/Qwen 72B on Boolq Validation
conflict-main-2a556f1d-96da-402a-8bbe-15e1058c8d21
boolq/super_glue_boolq_validation.parquet
text → text
OpenAI OpenAI
openai GPT-4o mini
qwen_prediction_retry
According to the context, answer the question with only 1 or 0. 1 for yes and 0 for no
context:
{passage}
question:
{question}
Dec 7, 2024, 12:02 AM UTC
Dec 7, 2024, 12:02 AM UTC
5 row sample
1061 tokens$ 0.0002
5 rows processed, 1061 tokens used ($0.0002)
Estimated cost for all 3270 rows: $0.1056
Sample Resultscompleted
8 columns, 1-5 of 3270 rows