Evaluations/934dcc31-0579-4feb-8b8f-493947a952c3
plan-and-solve
query_result.parquet
text → text
OpenAIOpenAI/GPT 4o
OpenAI OpenAI
is_correct
Check if the following answers are equivalent or not. Answer with true or false, one word, all lowercase.

Answer 1: {answer}
Answer 2: {prediction}
Oct 18, 2024, 4:44 AM UTC
Oct 18, 2024, 4:44 AM UTC
5 row sample
229 tokens
5 rows processed, 229 tokens used
Sample Results completed
9 columns, 1-5 of 100 rows