Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
2fd563fe-0b38-4d71-b885-a288e8b57c4b
2fd563fe-0b38-4d71-b885-a288e8b57c4b
1 row sample completed
Paco Aranda
Paco Aranda
3 days ago
Prompt: Translate the following description into spanish: {description}
1 iteration 244 tokens$ 0.0001
text → textopenaiOpenAI/GPT 4o mini
2569f1c1-44ca-4fb5-91de-863a48c8c005
2569f1c1-44ca-4fb5-91de-863a48c8c005
5 rows completed
Paco Aranda
Paco Aranda
1 week ago
Prompt: Write a description for the following image {path}
2 iterations 2698 tokens$ 0.0006
image → textopenaiOpenAI/GPT 4o mini
translation
9b70f8be-c77f-4107-8db6-bfae41936bf1
653 / 1949 rows Running started (33.5%)
Paco Aranda
Paco Aranda
1 week ago
Prompt: Apply the following prompt and provide the response in Spanish: {prompt}
2 iterations 298098 tokens$ 0.1269
text → textopenaiOpenAI/GPT 4o mini
text
e08309cb-c96d-49c0-a0de-f79871551522
5 rows completed
Paco Aranda
Paco Aranda
1 week ago
Prompt: Describe the image defined in the following path: {path}
image → textgoogleGoogle/Gemini 2.0 Flash