Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
GPT-4o Evaluating Mixtral 7X8B
381a59b1-f580-4c86-bcbc-761868afa3ce 112 rows completed
Mathias Barragan
3 weeks ago
Prompt: You are an expert at evaluating similarities between product descriptions. You will be given two descriptions, rate them between 1-5. If they are similar rate them a 5, if dissimilar a 1.
Evaluate how similar these two descriptions are:
Description 1:
{description}
Description 2:
{french_to_english}
Only respond with the number, nothing else, no matter what.
text → textOpenAI/GPT 4o mini
Source:
french-mixtral-8x7b
Target:
evaluations
GPT-4o Evaluating Mistral7X8B Translation
f258abd0-5a30-473d-9d37-c2807e58076a 5 row sample completed
Mathias Barragan
3 weeks ago
Prompt: You are an expert at evaluating similarities between product descriptions. You will be given two descriptions, rate them between 1-5. If they are similar rate them a 5, if dissimilar a 1.
Evaluate how similar these two descriptions are:
Description 1:
{description}
Description 2:
{french_to_english}
Only respond with the number, nothing else, no matter what.
text → textOpenAI/GPT 4o mini
Source:
french-mixtral-8x7b
Backtranslation with mistral 8X7b
ee1f7b0b-b6b0-4451-839b-0ebcebae077b 112 rows completed
Mathias Barragan
3 weeks ago
Prompt: You are an expert in translating from French to English. Only respond with the translated text nothing else.
Translate the following text to English:
{english_to_french}
text → textMistral AI/Mixtral 8x7B
Source:
french-mixtral-8x7b
Target:
french-mixtral-8x7b
Compute embeddings
981e6579-76e4-4031-b434-dd325506a1d0 112 rows completed
Bessie
2 months ago
Prompt: name
text → embeddingsOpenAI/Text Embedding 3 - Small
Source:
main
Target:
embeddings
Mistral 8x7B English To French
51fe4337-f632-47d9-8c10-e6e83ec5e483 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from English to French. Only respond with the translated text nothing else.
Translate the following text to French:
{description}
text → textMistral AI/Mixtral 8x7B
Source:
main
Target:
french-mixtral-8x7b
Llama 8B Spanish to English
5a5add4b-3551-47bf-868e-b1dd53ad2180 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else.
Translate the following text to English:
{english_to_spanish}
text → textFireworks AI/Llama 3.1 8B Instruct
Source:
spanish-llama-8b
Target:
spanish-llama-8b
Llama 70B Translate Spanish To English
74bf5344-3cc0-48ea-b1a0-1971c7142d5e 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else.
Translate the following text to English:
{english_to_spanish}
text → textFireworks AI/Llama 3.1 70B Instruct
Source:
spanish-llama-70b
Target:
spanish-llama-70b
Llama 3.1 8B English To Spanish
09ebd1e3-ad05-4bd3-8069-8b7ee5b18285 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else.
Translate the following text to Spanish:
{description}
text → textFireworks AI/Llama 3.1 8B Instruct
Source:
main
Target:
spanish-llama-8b
Llama 3.1 70B English To Spanish
9a1b6ac0-95c8-49e0-ab1b-91dea9ed8ef5 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else.
Translate the following text to Spanish:
{description}
text → textFireworks AI/Llama 3.1 70B Instruct
Source:
main
Target:
spanish-llama-70b
GPT-4o Translate Spanish To English
94e4db74-c5af-4cf1-9ad1-b32b3f25c942 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else.
Translate the following text to English:
{english_to_spanish}
text → textOpenAI/GPT 4o
Source:
spanish-gpt-4o
Target:
spanish-gpt-4o
GPT-4o-mini Spanish to English
b4604a61-8e78-40a5-8b4d-c100bfe7c512 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else.
Translate the following text to English:
{english_to_spanish}
text → textOpenAI/GPT 4o mini
Source:
spanish-gpt-4o-mini
Target:
spanish-gpt-4o-mini
GPT-4o Spanish to English
da83d146-bd11-4caa-ad16-efe92eea4d77 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else.
Translate the following text to Spanish:
{description}
text → textOpenAI/GPT 4o
Source:
main
Target:
spanish-gpt-4o
GPT-4o Mini Spanish to English
2af0297c-be9c-490c-9776-1a4bf8acac67 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else.
Translate the following text to Spanish:
{description}
text → textOpenAI/GPT 4o mini
Source:
main
Target:
spanish-gpt-4o-mini
Llama 405B Spanish To English
202f0e2f-e962-4e55-ab7f-1cbaaedaee8a 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from Spanish to English. Only respond with the translated text nothing else.
Translate the following text to English:
{english_to_spanish}
text → textFireworks AI/Llama 3.1 405B Instruct
Source:
spanish
Target:
spanish
Llama 405B Translation
d4f2cf91-2241-4703-a36b-7bc8218f9514 112 rows completed
Bessie
2 months ago
Prompt: You are an expert in translating from English to Spanish. Only respond with the translated text nothing else.
Translate the following text to Spanish:
{description}
text → textFireworks AI/Llama 3.1 405B Instruct
Source:
main
Target:
spanish