Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
2fd563fe-0b38-4d71-b885-a288e8b57c4b
2fd563fe-0b38-4d71-b885-a288e8b57c4b 1 row sample completed

Paco Aranda
3 days ago
Prompt: Translate the following description into spanish:
{description}
text → text
OpenAI/GPT 4o mini
Source:
2569f1c1-44ca-4fb5-91de-863a48c8c005
2569f1c1-44ca-4fb5-91de-863a48c8c005 5 rows completed

Paco Aranda
1 week ago
Prompt: Write a description for the following image
{path}
image → text
OpenAI/GPT 4o mini
Source:
Target:
translation
9b70f8be-c77f-4107-8db6-bfae41936bf1 653 / 1949 rows Running started (33.5%)

Paco Aranda
1 week ago
Prompt: Apply the following prompt and provide the response in Spanish:
{prompt}
text → text
OpenAI/GPT 4o mini
Source:
text
e08309cb-c96d-49c0-a0de-f79871551522 5 rows completed

Paco Aranda
1 week ago
Prompt: Describe the image defined in the following path:
{path}
2 iterations$ 0.0000
image → text
Google/Gemini 2.0 Flash

Source: