Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
Sentiment Analysis
7c2aa00c-b9d8-4e38-918d-a6e9ec0960ba 4000 rows 00:19:19completed
Bessie
4 days ago
Prompt: Classify the following text into positive, negative or neutral, based on the sentiment of the text. All lowercase, one word.
{text}
textFireworks AI/Llama v3.1 70B Instruct
Source:
main
Target:
evaluating_sentiment_llama_70B
27fb870b-9302-4616-af19-3590a7aff15f
27fb870b-9302-4616-af19-3590a7aff15f 5 row sample 00:00:02completed
Bessie
5 days ago
Prompt: Classify the text into positive, negative or neutral sentiment. One word all lowercase.
{text}
textGoogle/Gemini 1.5 Flash - 8B
Source:
main
Mistral Large Take 2
57993a40-2bb5-46a6-b9e6-a619d457132c 1000 rows 00:09:56completed
Bessie
1 week ago
Prompt: Compute the sentiment of the text based on how well the companies are performing in the market.
Return only one of three options: positive, negative, or neutral.
Respond with one word, all lowercase.
Text:
{text}
textMistral AI/Mistral Large
Source:
sentiment-analysis-mistral-large
Target:
sentiment-analysis-mistral-large-take-2
GPT-4o Sentiment Analysis
fb24aca8-7598-4208-bd15-3dd8630285f7 1000 rows 00:09:36completed
Bessie
1 week ago
Prompt: Compute the sentiment of the text based on how well the companies are performing in the market.
Return only one of three options: positive, negative, or neutral.
Respond with one word, all lowercase.
Text:
{text}
textOpenAI/GPT-4o
Source:
main
Target:
sentiment-analysis-gpt-4o
Mistral Large Sentiment Analysis
fc6c8632-4ef2-4b80-9c7f-8753b7921f5b 1000 rows 00:09:37completed
Bessie
1 week ago
Prompt: Compute the sentiment of the text based on how well the companies are performing in the market.
Return only one of three options: positive, negative, or neutral.
Respond with one word, all lowercase.
Text:
{text}
textMistral AI/Mistral Large
Source:
main
Target:
sentiment-analysis-mistral-large
Ministral 8B Sentiment Analysis
889f014a-d37e-4ab7-969a-f31ab77f9c58 1000 rows 00:07:50completed
Bessie
1 week ago
Prompt: Compute the sentiment of the text based on how well the companies are performing in the market.
Return only one of three options: positive, negative, or neutral.
Respond with one word, all lowercase.
Text:
{text}
textMistral AI/Ministral 8B
Source:
main
Target:
sentiment-analysis-ministral-8b
Ministral 3B Sentiment Analysis
0c7bfd16-c0c2-4327-b915-2a9e8e7cc2b7 1000 rows 00:07:50completed
Bessie
1 week ago
Prompt: Compute the sentiment of the text based on how well the companies are performing in the market.
Return only one of three options: positive, negative, or neutral.
Respond with one word, all lowercase.
Text:
{text}
textMistral AI/Ministral 3B
Source:
main
Target:
sentiment-analysis-ministral-3b
Clean valid
c8d6b4ac-ae59-4001-b4a4-920e82c64b78 842 rows 00:14:03completed
Bessie
2 weeks ago
Prompt: Fix the punctuation in the following text
{Sentence}
textOpenAI/GPT-4o mini
Source:
main
Target:
cleaned_valid
Fix punctuation train
7a82f117-0432-4b39-981b-6bcb152c91e9 4000 rows 01:09:11completed
Bessie
2 weeks ago
Prompt: Fix the punctuation in the following text
{Sentence}
textOpenAI/GPT-4o mini
Source:
main
Target:
clean_train
Fix Punctuation
e98a97a2-ddbc-4e2c-9aea-c33bcbe509f3 1000 rows 00:16:47completed
Bessie
2 weeks ago
Prompt: Fix the punctuation in the following text
{text}
textOpenAI/GPT-4o mini
Source:
main
Target:
main
Computing sentiment
d6795b42-8c63-42d6-a140-1e51a4da7f3f 1999 rows 00:19:20completed
Bessie
2 weeks ago
Prompt: What is the sentiment of the following text. Please respond with positive, negative or neutral. All one word. All lowercase.
{text}
textOpenAI/GPT-4o mini
Source:
main
Target:
predictions
cbd282ff-723b-474c-bfbb-6b7caaadec94
cbd282ff-723b-474c-bfbb-6b7caaadec94 5 row sample 00:00:03completed
Bessie
3 weeks ago
Prompt: Compute sentiment for the following text. One word, all lowercase. Positive, negative or neutral
{text}
textOpenAI/GPT-4o mini
Source:
main
Llama Sentiment
e5b81e95-a028-43c7-b3c4-b34cb96fcd04 100 rows 00:00:58completed
Bessie
4 weeks ago
Prompt: You are a financial analyst and you want to find if the companies mentioned are mentioned in a positive, negative or neutral light. Respond all lowercase, one word.
{text}
textTogether.ai/Meta Llama 3.1 8B Instruct Turbo
Source:
main
Target:
sentiment
Financial Sentiment Analysis
e9ba9a18-c973-450c-9514-049e85f3f20d 100 rows 00:00:49completed
Bessie
1 month ago
Prompt: Compute the sentiment of the text based on the how well the companies are performing in the market.
Return only one of three options: positive, negative or neutral.
Respond with one word all lowercase.
Text:
{text}
textTogether.ai/Meta Llama 3.1 8B Instruct Turbo
Source:
main
Target:
llama-3.1-8b-sentiment
Sentiment Ministral
abd94a83-3768-4115-bd51-22adbf8266de 100 rows 00:01:42completed
Bessie
1 month ago
Prompt: compute the sentiment of the text, positive, negative or neutral, one word all lowercase
{text}
textMistral AI/Ministral 3B
Source:
main
Target:
ministral-3b-sentiment
Sentiment Analysis
e0bbf280-af43-4d71-a5d2-a2a834f93cae 5 row sample 00:00:03completed
Bessie
1 month ago
Prompt: compute the sentiment of the text, positive, negative or neutral, one word all lowercase
{text}
2 iterations 291 tokens
textOpenAI/GPT-4o
Source:
main
Sentiment
850b4c99-ac01-4210-81f6-5dda2ba3285a 100 rows 00:01:07completed
Bessie
1 month ago
Prompt: Classify the text into positive negave or neutral sentiment. respond with one word all lowercase.
{text}
3 iterations 5654 tokens
textOpenAI/GPT-4o
Source:
main
Target:
main
sentiment
fcb0b14c-5565-4c4b-8ce3-8726ad23174e 5 row sample 00:00:02completed
Bessie
1 month ago
Prompt: Find the sentiment of the following text, respond with just one word: positive, negative or neutral all lowercase
{text}
3 iterations 311 tokens
textOpenAI/GPT-4o
Source:
main
Sentiment Analysis
34042ba6-86bb-48eb-89c2-f00b7afcf401 5 row sample 00:00:04completed
Bessie
1 month ago
Prompt: Classify the text into positive, negative or neutral sentiment
{text}
1 iteration 228 tokens
textOpenAI/GPT-4o
Source:
main
Translate to french
79aa0000-3297-4ac8-9866-7d20c0ff203c 5 row sample 00:00:03completed
Bessie
1 month ago
Prompt: Translate this from English to French
{text}
1 iteration 358 tokens
textOpenAI/GPT-4o
Source:
main
Translate to french
c27f903b-5b0e-4be8-afc9-0522997cdee0 5 row sample 00:00:05completed
Bessie
1 month ago
Prompt: Translate this from English to French
{text}
1 iteration 357 tokens
textOpenAI/GPT-4o
Source:
main
c880bd08-4b52-4c9b-9c06-39700d140d5f
c880bd08-4b52-4c9b-9c06-39700d140d5f 5 row sample 00:00:02completed
Bessie
1 month ago
Prompt: Classify the sentiment, respond in one word
{text}
1 iteration 210 tokens
textOpenAI/GPT-4o
Source:
main
Testing Sentiment
15e20ce8-ca37-4786-b13c-b42b350ee1e5 100 rows 00:01:41completed
Bessie
1 month ago
Prompt: Classify this into positive, negative, neutral, respond with only one word.
{text}
3 iterations 5289 tokens
textOpenAI/GPT-4o
Source:
main
Target:
d97a086a-d502-40d4-9413-7a62da39ee95
d97a086a-d502-40d4-9413-7a62da39ee95 10 row sample 00:00:05completed
Bessie
1 month ago
Prompt: Classify this into positive, negative, or neutral. Respond with one word, all lowercase.
{text}
1 iteration 585 tokens
textOpenAI/GPT-4o
Source:
main
78f86efd-7bc7-44d8-a06a-29a8470c29e7
78f86efd-7bc7-44d8-a06a-29a8470c29e7 10 row sample 00:00:04completed
Adam Singer
2 months ago
Prompt: You are a financial analyst who sees bad market outcomes as good and good market outcomes as bad because you are trying to short companies. Answer with a number between -10 and 10 with 10 meaning extremely positive opportunity to short, and -10 meaning an extremely negative opportinity to short response ONLY with the number
{text}
7 iterations 1019 tokens
textOpenAI/GPT-4o
Source:
main
3 Shot Prompting
58126199-3ec5-42a2-935a-9312f4d892f3 100 rows 00:01:06completed
Bessie
2 months ago
Prompt: Text: Operating profit improved by 39.9% to EUR 18.0 mn from EUR12.8 mn.
Sentiment: negative
Text: Net sales have been eaten by the weak US dollar.
Sentiment: positive
Text: Includes company and brand share data by category, as well as distribution channel data.
Sentiment: neutral
Text: {text}
Sentiment:
3 iterations 11470 tokens
textOpenAI/GPT-4o
Source:
main
Target:
main
Emotion Prompting
9cc8fbb8-1087-4eac-be53-f79269984524 100 rows 00:00:58completed
Bessie
2 months ago
Prompt: classify the text into positive, negative, or neutral sentiment. Respond with a single word, all lowercase. Think really hard about if it is positive or negative. My career depends on you getting it right!
{text}
2 iterations 7954 tokens
textOpenAI/GPT-4o
Source:
main
Target:
main
Style Prompting
e1a47544-2405-4e9a-b02e-5e3a65b943b9 100 rows 00:00:59completed
Bessie
2 months ago
Prompt: classify the text into positive, negative, or neutral sentiment. Respond with a single word, all lowercase.
{text}
2 iterations 5954 tokens
textOpenAI/GPT-4o
Source:
main
Target:
style_prompting