Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
a192d86b-e03e-4808-9268-6286924bb906
a192d86b-e03e-4808-9268-6286924bb906 10 rows 00:00:54completed
Bessie
1 month ago
Prompt: what does {company_description} do
1 iteration 2889 tokens
textOpenAI/GPT-4o
Source:
describe companies
2e50f2b0-3361-4474-b727-ab8aa39ce6cb 10 rows 00:01:05completed
Bessie
1 month ago
Prompt: what does {company_name} do
1 iteration 3176 tokens
textOpenAI/GPT-4o
Source:
Target:
Are competitive?
77de74cc-ce44-4ee7-a188-5030a2fefccb 10 rows 00:00:08completed
Bessie
1 month ago
Prompt: You are an expert at evaluating venture capital portfolios. You are considering the following description of a portfolio company:
Oxen.ai is a platform for versioning, storing, and evaluating machine learning data and models.
Here is another portfolio company:
{company_description}
Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
2 iterations 1139 tokens
textOpenAI/GPT-4o
Source:
Target:
conflict-api-results-branch-Firestreak-75bc9c0d-e5b2-490a-9e6b-1451f48d65a0
Firestreak Portfolio Evaluation
96a1eaeb-1e23-4587-9d5f-2a6fd93bfd6d 10 rows 00:00:09completed
Bessie
1 month ago
Prompt:
You are an expert at evaluating venture capital portfolios. You are considering the following portfolio:
Oxen.ai is a platform for versioning, storing, and evaluating data.
Here is another portfolio company:
{company_description}
Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
1 iteration 1069 tokens
textOpenAI/GPT-4o
Source:
Rackhouse Portfolio Evaluation
8d6e67ec-ea3f-40d8-9a0b-d604fa0e3dce 10 rows 00:00:08completed
Bessie
1 month ago
Prompt:
You are an expert at evaluating venture capital portfolios. You are considering the following portfolio:
Oxen.ai is a platform for versioning, storing, and evaluating data.
Here is another portfolio company:
{company_description}
Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
1 iteration 1181 tokens
textOpenAI/GPT-4o
Source:
Pebblebed Portfolio Evaluation
cc5b8a52-d54b-4517-b504-e6dfe5794397 10 rows 00:00:09completed
Bessie
1 month ago
Prompt:
You are an expert at evaluating venture capital portfolios. You are considering the following portfolio:
Oxen.ai is a platform for versioning, storing, and evaluating data.
Here is another portfolio company:
{company_description}
Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
1 iteration 1098 tokens
textOpenAI/GPT-4o
Source:
Sequoia_Capital Portfolio Evaluation
68b2e0b6-0332-439e-b942-f1ee8c1bf21a 10 rows 00:00:09completed
Bessie
1 month ago
Prompt:
You are an expert at evaluating venture capital portfolios. You are considering the following portfolio:
Oxen.ai is a platform for versioning, storing, and evaluating data.
Here is another portfolio company:
{company_description}
Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
1 iteration 1170 tokens
textOpenAI/GPT-4o
Source: