Repository evaluations - mathi/GettingStarted

Evaluations

Run models against your data

Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.

Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.

Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.

Similar Image Gen

6bfdb7d2-70da-4840-b33f-dc2bd0325d43

OpenAI/DALL-E 3text → image

mathi

3 days ago

Prompt

Generate a similar image:
{path}

main

tables/cats_vs_dogs.tsv

completed 5 row sample0 tokens$ 0.2000 1 iteration

7db6c1d7-2174-460e-b7cd-f3b769421f8c

OpenAI/GPT 4o minitext → text

mathi

5 days ago

Prompt

I am to pass a job desc. give me a 1 if its a plumber, a 2 if contractor, 
{title}

main

tables/train_0_50000.parquet/train_0_50000.parquet

completed 5 row sample579 tokens$ 0.0003 1 iteration

69fa58a8-2a9c-48e0-97ff-bbc3a147bfc6

OpenAI/GPT 4o miniimage → text

mathi

5 days ago

Prompt

what is this :
{path}

main

tables/cats_vs_dogs.tsv

completed 5 row sample42811 tokens$ 0.0065 1 iteration

cbe00a1b-3e9b-4523-add6-0da9512fda87

OpenAI/GPT 4o minitext → text

mathi

5 days ago

Prompt

How many "a" in the title:
{title}

main

tables/train_0_50000.parquet/train_0_50000.parquet

completed 5 row sample197 tokens$ 0.0001 1 iteration

defc7aa6-7ed9-4b87-82e5-fa4b4b3d72bd

OpenAI/GPT 4o minitext → text

mathi

5 days ago

Prompt

How many "a" are there in "{title}"

main

tables/train_0_50000.parquet/train_0_50000.parquet

completed 5 row sample199 tokens$ 0.0001 1 iteration

26be7b57-59cf-45cd-af9c-31a483d43dfc

OpenAI/GPT 4o minitext → text

mathi

5 days ago

Prompt

How many "a" are in the question:
{title}

main

tables/train_0_50000.parquet/train_0_50000.parquet

completed 5 row sample204 tokens$ 0.0001 1 iteration

DALL-E 3 Gen

60493aa5-0872-4930-92f5-8f712dfe1b01

OpenAI/DALL-E 3text → image

mathi

1 week ago

Prompt

Generate an image of an ox

main

tables/cats_vs_dogs.tsv

completed 5 row sample0 tokens$ 0.2000 1 iteration