Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
70ef3b23-b9e1-4e4c-978a-dd0c2ff5553e
GoogleGoogle/Text Embedding 004text → embeddings
Bessie
ox
2 months ago
markdown_text
completed 5 row sample0 tokens$ 0.0000 1 iteration
59a4ee4a-3162-4ce3-ad3c-f5cfbc8cf2f1
OpenAIOpenAI/GPT 4o minitext → text
Bessie
ox
2 months ago
Summarize the following text in a single sentence.

{section}
completed 5 row sample2096 tokens$ 0.0004 2 iterations
5f4c7930-4cfd-4245-a25e-39ea7db5d329
OpenAIOpenAI/Text Embedding 3 - Smalltext → embeddings
Bessie
ox
2 months ago
answer_relevance_questions
completed 300 rows5563 tokens$ 0.0001 1 iteration
95d7dfb7-0bf9-40ef-9853-e3f283451051
OpenAIOpenAI/Text Embedding 3 - Smalltext → embeddings
Bessie
ox
2 months ago
question
completed 300 rows6405 tokens$ 0.0001 2 iterations
b26c70c9-929f-488a-851b-4f41ed3b7d59
OpenAIOpenAI/GPT 4o minitext → text
Bessie
ox
3 months ago
Please extract relevant sentences from the provided context that can potentially help answer the following question. If no relevant sentences are found, or if you believe the question cannot be answered from the given context, return the phrase "Insufficient Information". While extracting candidate sentences you're not allowed to make any changes to sentences from given context.

Question:
{question}

Context:
{rag_context}
completed 100 rows123450 tokens$ 0.0230 2 iterations
33d672d8-e463-4132-b895-1426d1b3c589
OpenAIOpenAI/GPT 4o minitext → text
Bessie
ox
3 months ago
Generate 3 questions for the given the answer. Generate the questions in an ordered list:

1.
2.
3.

Answer: {answer}
error An exception occurred indexing, getting dataframe and running evaluation: %Req.TransportError{reason: :closed} 100 rows14131 tokens$ 0.0050 2 iterations
a2a8d567-ab5f-42da-9578-0ea3f2daecf9
OpenAIOpenAI/GPT 4o minitext → text
Bessie
ox
3 months ago
Consider the given context and following statements, then determine whether they are supported by the information present in the context. Provide a brief explanation for each statement before arriving at the final verdict (Yes/No). Provide a final vertict for each statement in order at the end in the given format. Do not deviate from the specified format.

Context:
{context}

Statements:
{faithfulness_statements}
completed 100 rows38565 tokens$ 0.0166 3 iterations
9246a591-d95b-4781-8eb4-8487c1a7dc63
OpenAIOpenAI/GPT 4o minitext → text
Bessie
ox
3 months ago
Given a question and an answer, create one or more statements from each sentence in the given answer.

The statements should be in an ordered list such as

1. First Statement
2. Second Statement
etc...

question: {question}

answer: {answer}
completed 100 rows18912 tokens$ 0.0059 2 iterations
5235077a-645e-417d-9294-222971555232
OpenAIOpenAI/GPT 4o minitext → text
Bessie
ox
3 months ago
Consider the given context and following statements, then determine whether they are supported by the information present in the context. Provide a brief explanation for each statement before arriving at the final verdict (Yes/No). Provide a final vertict for each statement in order at the end in the given format. Do not deviate from the specified format.

Context:
{section}

Statements:
{statements}
completed 847 rows712485 tokens$ 0.1946 2 iterations
a0d21249-16f8-4ae5-a485-8590619cba79
OpenAIOpenAI/GPT 4o minitext → text
Bessie
ox
3 months ago
Given a question and an answer, create one or more statements from each sentence in the given answer.

The statements should be in an ordered list such as

1. First Statement
2. Second Statement
etc...

question: {question}

answer: {answer}
completed 847 rows145873 tokens$ 0.0451 4 iterations
d70645d9-7bdd-4ff2-971c-b3b4680446b8
OpenAIOpenAI/GPT 4otext → text
Bessie
ox
3 months ago
Answer the following question as succinctly as possible given the context. If the question cannot be answered given the context, respond with not_answerable.

Context:
{section}

Quetion:
{question}

Answer:
completed 847 rows487127 tokens$ 1.45 2 iterations
c783abcd-42c7-4cf0-8c6c-1e41680e7427
OpenAIOpenAI/GPT 4o minitext → text
Bessie
ox
3 months ago
Answer the following question as succinctly as possible given the context. If the question cannot be answered given the context, respond with not_answerable.

Context:
{section}

Quetion:
{question}

Answer:
completed 5 row sample4309 tokens$ 0.0007 1 iteration
9bf059a9-9a44-47f9-99ee-390ed38d7eaa
OpenAIOpenAI/o1 minitext → text
Bessie
ox
3 months ago
Answer the following question as succinctly as possible given the context. If the question cannot be answered given the context, respond with not_answerable.

Context:
{section}

Question:
{question}

Answer:
completed 5 row sample5333 tokens$ 0.0261 4 iterations
168136aa-d514-4dd1-8117-f8fca5c8feae
OpenAIOpenAI/Text Embedding 3 - Smalltext → embeddings
Bessie
ox
3 months ago
paper_title
completed 36 rows445 tokens$ 0.0000 1 iteration
280b389a-2340-4514-a09b-de46921c872e
OpenAIOpenAI/Text Embedding 3 - Smalltext → embeddings
Bessie
ox
3 months ago
section
completed 1635 rows843145 tokens$ 0.0169 2 iterations