Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
Hermes 70B response Gen
121edff2-39df-4325-a8b4-7305eeca6016 1000 rows completed

Mathias Barragan
2 months ago
Prompt: {uuid} Imagine you are {role} what is a response you would give to the question about {product}? Only give the response, nothing else:
{question}
text → text
Lambda Labs/ Hermes 3 70B
Source:
Hermes-3-70B
Target:
Hermes-3-70B
Hermes Question Gen
ce2ac11e-bbbf-48b6-8c79-4833a9ec499b 1000 rows completed

Mathias Barragan
2 months ago
Prompt: {uuid} Imagine you are {role}, what is a question you would ask about {product}? Answer with only the question, nothing else.
text → text
Lambda Labs/ Hermes 3 70B
Source:
Hermes-3-70B
Target:
Hermes-3-70B
Hermes 70B Product Gen
e10c2e0d-5b0e-42a4-b4cd-ca282bcb6366 1000 rows completed

Mathias Barragan
2 months ago
Prompt: {uuid} Imagine you are a {role} what is the most necessary software product you need for your job? Just answer with the product. No extra words or descriptions.
text → text
Lambda Labs/ Hermes 3 70B
Source:
Hermes-3-70B
Target:
Hermes-3-70B
Hermes 70B Product Gen
781f8b03-3fe4-410b-94ff-38000acb5fed 5 row sample completed

Mathias Barragan
2 months ago
Prompt: {uuid} Imagine you are a {role} what is the most necessary software product you need for your job? Just answer with the product. No extra words or descriptions.
text → text
Lambda Labs/ Hermes 3 70B
Source:
Hermes-3-70B
Llama v.3.1 70B Response
0acd0e8b-4bb4-45fe-98ff-ad5d2ec1e43c 1000 rows completed

Mathias Barragan
2 months ago
Prompt: {uuid} Imagine you are {role} what is a response you would give to the question about {product}? Only give the response, nothing else:
{question}
text → text
Fireworks AI/Llama 3.1 70B Instruct

Source:
Llama-v3.1-70B
Target:
Llama-v3.1-70B
Question Generation
49a96fe5-3369-4c62-b272-17f48967b5e9 1000 rows completed

Mathias Barragan
2 months ago
Prompt: {uuid} Imagine you are {role}, what is a question you would ask about {product}? Answer with only the question, nothing else.
text → text
Fireworks AI/Llama 3.1 70B Instruct

Source:
Llama-v3.1-70B
Target:
Llama-v3.1-70B
Product question generation
04f75190-395f-4b43-b480-2132b79b7576 1000 rows completed

Mathias Barragan
3 months ago
Prompt: {uuid} Imagine you are a {role} what is a question you would ask about {product} to see if its a good fit for your job?
text → text
Fireworks AI/Llama 3.1 70B Instruct

Source:
Llama-v3.1-70B
Target:
Question about product gen
b3783115-fc9e-42a6-949e-1a3a83455326 1000 rows completed

Mathias Barragan
3 months ago
Prompt: {uuid} Imagine you are a {role} what is a question you would ask about {product} to see if its a good fit for your job?
text → text
Fireworks AI/Llama 3.1 70B Instruct

Source:
Llama-v3.1-70B
Target:
Llama v.3.1 70B
71cee76a-32a5-433e-84f1-f34d2d5c780e 1000 rows completed

Mathias Barragan
3 months ago
Prompt: {uuid} Imagine you are a {role} what is the most necessary software product you need for your job? Just answer with the product. No extra words or descriptions.
text → text
Fireworks AI/Llama 3.1 70B Instruct

Source:
main
Target:
Llama-v3.1-70B