Evaluations/8ecd8278-e1d2-43d9-9f64-e23ee4a1c8f6
openai-answer-extract
rag_instruct_test.jsonl
text
OpenAI OpenAI
openai GPT-4o mini
is_correct
Are the following two answers equivalent? If the answers contain numeric values, only compare the numbers and not the words. Answer "true" or "false". All lowercase.

Answer 1: {answer}
Answer 2: {prediction}
Nov 8, 2024, 12:39 AM UTC
Nov 8, 2024, 12:39 AM UTC
00:00:01
5 row sample
340 tokens$ 0.0001
5 rows processed, 340 tokens used ($0.0001)
Estimated cost for all 200 rows: $0.0021
Sample Resultscompleted
6 columns, 1-5 of 200 rows