Evaluations/30b3ca62-3c5c-4e57-b033-afc17a1d19c1
main
cb/super_glue_cb_validation.parquet
texttext
OpenAI OpenAI
openai GPT-4o mini
qwen_prediction
Based on the premise,determine whether to agree with the hypothesis. Respond with only 0, 1 or 2. 2 for Support, 0 for Oppose, or 1 for Neutral.
premise:
{premise}
hypothesis:
{hypothesis}
Dec 9, 2024, 7:08 PM UTC
Dec 9, 2024, 7:08 PM UTC
5 row sample
666 tokens$ 0.0001
5 rows processed, 666 tokens used ($0.0001)
Estimated cost for all 56 rows: $0.0011
Sample Resultscompleted
7 columns, 1-5 of 56 rows