History
Total running cost: $0.0150
PromptRowsTypeProvider/ModelTargetStatusRuntimeRunByTokensCost
Run
Are the two responses equivalent? Ignore punctuation and irrelevant characters and differences in verb tense. Reply with true or false. One word all lowercase. Response 1: {label} Response 2: {conclusion}
100texttext
OpenAI OpenAIopenai GPT-4o
7b788db7ccefc855e4515f91722df849completed 00:00:462 weeks agoox 5178 tokens$ 0.0137
Sample
Are the two responses equivalent? Ignore punctuation and irrelevant characters and differences in verb tense. Reply with true or false. One word all lowercase. Response 1: {label} Response 2: {conclusion}
5texttext
OpenAI OpenAIopenai GPT-4o
Sample - N/Acompleted 00:00:022 weeks agoox 251 tokens$ 0.0007
Sample
Are the two responses equivalent? Ignore punctuation and irrelevant characters. Reply with true or false. One word all lowercase. Response 1: {label} Response 2: {conclusion}
5texttext
OpenAI OpenAIopenai GPT-4o
Sample - N/Acompleted 00:00:022 weeks agoox 226 tokens$ 0.0006
Sample
Are the two responses equivalent? Reply with true or false. One word all lowercase. Response 1: {label} Response 2: {conclusion}
5texttext
OpenAI OpenAIopenai GPT-4o mini
Sample - N/Acompleted 00:00:022 weeks agoox 196 tokens$ 0.0000
Sample
Are the two responses equivalent? Reply with true or false. Response 1: {label} Response 2: {conclusion}
5texttext
OpenAI OpenAIopenai GPT-4o mini
Sample - N/Acompleted 00:00:022 weeks agoox 173 tokens$ 0.0000