Repository evaluations - ox/Arxiv-Dive-RAG

Evaluations

Run models against your data

Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.

Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.

Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.

Generate markdown embeddings

70ef3b23-b9e1-4e4c-978a-dd0c2ff5553e

Google/Text Embedding 004text → embeddings

2 months ago

Prompt

markdown_text

main

data/arxiv_markdown.parquet

completed 5 row sample0 tokens$ 0.0000 1 iteration

Testing Summarize

59a4ee4a-3162-4ce3-ad3c-f5cfbc8cf2f1

OpenAI/GPT 4o minitext → text

3 months ago

Prompt

Summarize the following text in a single sentence.

{section}

main

data/arxiv_markdown_chunks.parquet

completed 5 row sample2096 tokens$ 0.0004 2 iterations

Generate answer relevance question embeddings

5f4c7930-4cfd-4245-a25e-39ea7db5d329

OpenAI/Text Embedding 3 - Smalltext → embeddings

3 months ago

Prompt

answer_relevance_questions

main

data/100_questions/answer_relevance_questions_expanded.parquet

main

data/100_questions/answer_relevance_questions_expanded.parquet

completed 300 rows5563 tokens$ 0.0001 1 iteration

Generate question embeddings

95d7dfb7-0bf9-40ef-9853-e3f283451051

OpenAI/Text Embedding 3 - Smalltext → embeddings

3 months ago

Prompt

question

main

data/100_questions/answer_relevance_questions_expanded.parquet

main

data/100_questions/answer_relevance_questions_expanded.parquet

completed 300 rows6405 tokens$ 0.0001 2 iterations

Context Relevance - Extract Relevant Sentences

b26c70c9-929f-488a-851b-4f41ed3b7d59

OpenAI/GPT 4o minitext → text

3 months ago

Prompt

Please extract relevant sentences from the provided context that can potentially help answer the following question. If no relevant sentences are found, or if you believe the question cannot be answered from the given context, return the phrase "Insufficient Information". While extracting candidate sentences you're not allowed to make any changes to sentences from given context.

Question:
{question}

Context:
{rag_context}

main

data/100_questions/search_results.parquet

main

data/100_questions/context_relevance.parquet

completed 100 rows123450 tokens$ 0.0230 2 iterations

Answer Relevance Question Generation

33d672d8-e463-4132-b895-1426d1b3c589

OpenAI/GPT 4o minitext → text

3 months ago

Prompt

Generate 3 questions for the given the answer. Generate the questions in an ordered list:

1.
2.
3.

Answer: {answer}

main

data/100_questions/generated_answers.parquet

N/A

data/100_questions/answer_relevance_questions.parquet

error An exception occurred indexing, getting dataframe and running evaluation: %Req.TransportError{reason: :closed} 100 rows14131 tokens$ 0.0050 2 iterations

Statement Faithfulness To Context

a2a8d567-ab5f-42da-9578-0ea3f2daecf9

OpenAI/GPT 4o minitext → text

3 months ago

Prompt

Consider the given context and following statements, then determine whether they are supported by the information present in the context. Provide a brief explanation for each statement before arriving at the final verdict (Yes/No). Provide a final vertict for each statement in order at the end in the given format. Do not deviate from the specified format.

Context:
{context}

Statements:
{faithfulness_statements}

main

data/100_questions/faithfulness_statements.parquet

main

data/100_questions/faithfulness_statements.parquet

completed 100 rows38565 tokens$ 0.0166 3 iterations

Faithfulness Statements

9246a591-d95b-4781-8eb4-8487c1a7dc63

OpenAI/GPT 4o minitext → text

3 months ago

Prompt

Given a question and an answer, create one or more statements from each sentence in the given answer.

The statements should be in an ordered list such as

1. First Statement
2. Second Statement
etc...

question: {question}

answer: {answer}

main

data/100_questions/generated_answers.parquet

main

data/100_questions/faithfulness_statements.parquet

completed 100 rows18912 tokens$ 0.0059 2 iterations

5235077a-645e-417d-9294-222971555232

OpenAI/GPT 4o minitext → text

3 months ago

Prompt

Consider the given context and following statements, then determine whether they are supported by the information present in the context. Provide a brief explanation for each statement before arriving at the final verdict (Yes/No). Provide a final vertict for each statement in order at the end in the given format. Do not deviate from the specified format.

Context:
{section}

Statements:
{statements}

main

data/ragas/faithfulness.parquet

main

data/ragas/faithfulness.parquet

completed 847 rows712485 tokens$ 0.1946 2 iterations

Generate statements

a0d21249-16f8-4ae5-a485-8590619cba79

OpenAI/GPT 4o minitext → text

3 months ago

Prompt

Given a question and an answer, create one or more statements from each sentence in the given answer.

The statements should be in an ordered list such as

1. First Statement
2. Second Statement
etc...

question: {question}

answer: {answer}

main

data/questions_answered.parquet

main

data/ragas/faithfulness.parquet

completed 847 rows145873 tokens$ 0.0451 4 iterations

Answer Questions For Real

d70645d9-7bdd-4ff2-971c-b3b4680446b8

OpenAI/GPT 4otext → text

4 months ago

Prompt

Answer the following question as succinctly as possible given the context. If the question cannot be answered given the context, respond with not_answerable.

Context:
{section}

Quetion:
{question}

Answer:

main

data/answerable_questions.parquet

main

data/questions_answered.parquet

completed 847 rows487127 tokens$ 1.45 2 iterations

c783abcd-42c7-4cf0-8c6c-1e41680e7427

OpenAI/GPT 4o minitext → text

4 months ago

Prompt

Answer the following question as succinctly as possible given the context. If the question cannot be answered given the context, respond with not_answerable.

Context:
{section}

Quetion:
{question}

Answer:

main

data/answerable_questions.parquet

completed 5 row sample4309 tokens$ 0.0007 1 iteration

Answer questions

9bf059a9-9a44-47f9-99ee-390ed38d7eaa

OpenAI/o1 minitext → text

4 months ago

Prompt

Answer the following question as succinctly as possible given the context. If the question cannot be answered given the context, respond with not_answerable.

Context:
{section}

Question:
{question}

Answer:

main

data/answerable_questions.parquet