Evaluations/545f86b0-15b4-43e7-be10-322103db0b9d
main
cb/super_glue_cb_validation.parquet
text ā†’ text
OpenAI OpenAI
openai GPT-4o mini
qwen_prediction
Determine if the hypothesis is correct based on the premise.
premise:
{premise}
hypothesis:
{hypothesis}
Dec 9, 2024, 6:48 PM UTC
Dec 9, 2024, 6:48 PM UTC
5 row sample
828 tokens$ 0.0003
5 rows processed, 828 tokens used ($0.0003)
Estimated cost for all 56 rows: $0.0030
Sample Resultscompleted
7 columns, 1-5 of 56 rows