Evaluations/8b3b0517-88bd-4084-b612-f9d4fec412b3
openai-answer-extract
rag_instruct_test.jsonl
text
OpenAI OpenAI
openai GPT-4o mini
is_correct
Are the following two answers equivalent? If the answers contain numeric values, only compare the numbers and not the words. Answer "true" or "false".

Answer 1: {answer}
Answer 2: {prediction}
Nov 8, 2024, 12:39 AM UTC
Nov 8, 2024, 12:39 AM UTC
00:00:04
5 row sample
325 tokens$ 0.0001
5 rows processed, 325 tokens used ($0.0001)
Estimated cost for all 200 rows: $0.0020
Sample Resultscompleted
6 columns, 1-5 of 200 rows