Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
GPT-4o Evaluating Mixtral 7X8B
381a59b1-f580-4c86-bcbc-761868afa3ce
112 rows completed
Mathias Barragan
Mathias Barragan
3 weeks ago
Prompt: You are an expert at evaluating similarities between product descriptions. You will be given two descriptions, rate them between 1-5. If they are similar rate them a 5, if dissimilar a 1. Evaluate how similar these two descriptions are: Description 1: {description} Description 2: {french_to_english} Only respond with the number, nothing else, no matter what.
3 iterations 36794 tokens$ 0.0056
text → textopenaiOpenAI/GPT 4o mini
Source:
french-mixtral-8x7b
GPT-4o Evaluating Mistral7X8B Translation
f258abd0-5a30-473d-9d37-c2807e58076a
5 row sample completed
Mathias Barragan
Mathias Barragan
3 weeks ago
Prompt: You are an expert at evaluating similarities between product descriptions. You will be given two descriptions, rate them between 1-5. If they are similar rate them a 5, if dissimilar a 1. Evaluate how similar these two descriptions are: Description 1: {description} Description 2: {french_to_english} Only respond with the number, nothing else, no matter what.
2 iterations 2047 tokens$ 0.0003
text → textopenaiOpenAI/GPT 4o mini
Source:
french-mixtral-8x7b
Backtranslation with mistral 8X7b
ee1f7b0b-b6b0-4451-839b-0ebcebae077b
112 rows completed
Mathias Barragan
Mathias Barragan
3 weeks ago
Prompt: You are an expert in translating from French to English. Only respond with the translated text nothing else. Translate the following text to English: {english_to_french}
2 iterations 48960 tokens$ 0.0343
text → textmistralMistral AI/Mixtral 8x7B
Source:
french-mixtral-8x7b
Target:
french-mixtral-8x7b
Compute embeddings
981e6579-76e4-4031-b434-dd325506a1d0
112 rows completed
Bessie
Bessie
2 months ago
Prompt: name
1 iteration 769 tokens$ 0.0000
text → embeddingsopenaiOpenAI/Text Embedding 3 - Small
Source:
Target:
embeddings
Mistral 8x7B English To French
51fe4337-f632-47d9-8c10-e6e83ec5e483
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from English to French. Only respond with the translated text nothing else. Translate the following text to French: {description}
2 iterations 48512 tokens$ 0.0340
text → textmistralMistral AI/Mixtral 8x7B
Source:
Target:
french-mixtral-8x7b
Llama 8B Spanish to English
5a5add4b-3551-47bf-868e-b1dd53ad2180
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else. Translate the following text to English: {english_to_spanish}
1 iteration 38880 tokens$ 0.0078
text → textfireworksFireworks AI/Llama 3.1 8B Instruct
Source:
spanish-llama-8b
Target:
spanish-llama-8b
Llama 70B Translate Spanish To English
74bf5344-3cc0-48ea-b1a0-1971c7142d5e
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else. Translate the following text to English: {english_to_spanish}
1 iteration 40870 tokens$ 0.0368
text → textfireworksFireworks AI/Llama 3.1 70B Instruct
Source:
spanish-llama-70b
Target:
spanish-llama-70b
Llama 3.1 8B English To Spanish
09ebd1e3-ad05-4bd3-8069-8b7ee5b18285
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else. Translate the following text to Spanish: {description}
1 iteration 39053 tokens$ 0.0078
text → textfireworksFireworks AI/Llama 3.1 8B Instruct
Source:
Target:
spanish-llama-8b
Llama 3.1 70B English To Spanish
9a1b6ac0-95c8-49e0-ab1b-91dea9ed8ef5
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else. Translate the following text to Spanish: {description}
1 iteration 40254 tokens$ 0.0362
text → textfireworksFireworks AI/Llama 3.1 70B Instruct
Source:
Target:
spanish-llama-70b
GPT-4o Translate Spanish To English
94e4db74-c5af-4cf1-9ad1-b32b3f25c942
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else. Translate the following text to English: {english_to_spanish}
1 iteration 36202 tokens$ 0.1957
text → textopenaiOpenAI/GPT 4o
Source:
spanish-gpt-4o
Target:
spanish-gpt-4o
GPT-4o-mini Spanish to English
b4604a61-8e78-40a5-8b4d-c100bfe7c512
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else. Translate the following text to English: {english_to_spanish}
1 iteration 36629 tokens$ 0.0119
text → textopenaiOpenAI/GPT 4o mini
Source:
spanish-gpt-4o-mini
Target:
spanish-gpt-4o-mini
GPT-4o Spanish to English
da83d146-bd11-4caa-ad16-efe92eea4d77
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else. Translate the following text to Spanish: {description}
1 iteration 35666 tokens$ 0.2269
text → textopenaiOpenAI/GPT 4o
Source:
Target:
spanish-gpt-4o
GPT-4o Mini Spanish to English
2af0297c-be9c-490c-9776-1a4bf8acac67
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else. Translate the following text to Spanish: {description}
2 iterations 35787 tokens$ 0.0137
text → textopenaiOpenAI/GPT 4o mini
Source:
Target:
spanish-gpt-4o-mini
Llama 405B Spanish To English
202f0e2f-e962-4e55-ab7f-1cbaaedaee8a
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else. Translate the following text to English: {english_to_spanish}
2 iterations 41173 tokens$ 0.1235
text → textfireworksFireworks AI/Llama 3.1 405B Instruct
Source:
Target:
Llama 405B Translation
d4f2cf91-2241-4703-a36b-7bc8218f9514
112 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else. Translate the following text to Spanish: {description}
4 iterations 40458 tokens$ 0.1214
text → textfireworksFireworks AI/Llama 3.1 405B Instruct
Source:
Target: