Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
a192d86b-e03e-4808-9268-6286924bb906
a192d86b-e03e-4808-9268-6286924bb906
10 rows completed
Bessie
Bessie
2 months ago
Prompt: what does {company_description} do
1 iteration 2889 tokens
text → textopenaiOpenAI/GPT-4o
describe companies
2e50f2b0-3361-4474-b727-ab8aa39ce6cb
10 rows completed
Bessie
Bessie
2 months ago
Prompt: what does {company_name} do
1 iteration 3176 tokens
text → textopenaiOpenAI/GPT-4o
Are competitive?
77de74cc-ce44-4ee7-a188-5030a2fefccb
10 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert at evaluating venture capital portfolios. You are considering the following description of a portfolio company: Oxen.ai is a platform for versioning, storing, and evaluating machine learning data and models. Here is another portfolio company: {company_description} Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
2 iterations 1139 tokens
text → textopenaiOpenAI/GPT-4o
Target:
conflict-api-results-branch-Firestreak-75bc9c0d-e5b2-490a-9e6b-1451f48d65a0
Firestreak Portfolio Evaluation
96a1eaeb-1e23-4587-9d5f-2a6fd93bfd6d
10 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert at evaluating venture capital portfolios. You are considering the following portfolio: Oxen.ai is a platform for versioning, storing, and evaluating data. Here is another portfolio company: {company_description} Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
1 iteration 1069 tokens
text → textopenaiOpenAI/GPT-4o
Target:
api-results-branch-Firestreak
Rackhouse Portfolio Evaluation
8d6e67ec-ea3f-40d8-9a0b-d604fa0e3dce
10 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert at evaluating venture capital portfolios. You are considering the following portfolio: Oxen.ai is a platform for versioning, storing, and evaluating data. Here is another portfolio company: {company_description} Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
1 iteration 1181 tokens
text → textopenaiOpenAI/GPT-4o
Target:
api-results-branch-Rackhouse
Pebblebed Portfolio Evaluation
cc5b8a52-d54b-4517-b504-e6dfe5794397
10 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert at evaluating venture capital portfolios. You are considering the following portfolio: Oxen.ai is a platform for versioning, storing, and evaluating data. Here is another portfolio company: {company_description} Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
1 iteration 1098 tokens
text → textopenaiOpenAI/GPT-4o
Target:
api-results-branch-Pebblebed
Sequoia_Capital Portfolio Evaluation
68b2e0b6-0332-439e-b942-f1ee8c1bf21a
10 rows completed
Bessie
Bessie
2 months ago
Prompt: You are an expert at evaluating venture capital portfolios. You are considering the following portfolio: Oxen.ai is a platform for versioning, storing, and evaluating data. Here is another portfolio company: {company_description} Are these two companies competitive with each other? Respond with one word only, all lowercase: "true" or "false".
1 iteration 1170 tokens
text → textopenaiOpenAI/GPT-4o
Target:
api-results-branch-Sequoia_Capital