datasets
Organization Account
datasets's Repositories
Displaying Page 1 of 18 (179 total Repositories)

Here are example Marimo Notebooks to get started with Oxen.ai

1 mb
7
Updated: 1 day ago
Public
3

The SMS Spam Collection is a set of SMS tagged messages that have been collected for SMS Spam research. It contains one set of SMS messages in English of 5,574 messages, tagged according being ham (legitimate) or spam. The original data can be found here: https://archive.ics.uci.edu/ml/datasets/SMS+Spam+Collection

13.9 mb
21
Updated: 3 weeks ago

A starter repository that highlights some key features that you can get started with.

15.2 mb
1610
Updated: 1 month ago
Public
1

5.3 mb
32
Updated: 1 month ago

This repository is an example of how to generate synthetic fine tuning data with random personas. The final output is "prompt", "response" pairs for customer support tickets.

1.3 mb
321
Updated: 4 months ago

This repository is 1 million images collected from different sources to run chain of thought reasoning on

147 gb
184K1.2M3
Updated: 5 months ago
Public
2

A Benchmark for Question Answering about Charts with Visual and Logical Reasoning

976 mb
21K27
Updated: 5 months ago

An Advanced Diagnostic Suite for Entangled Language Hallucination & Visual Illusion in Large Vision-Language Models

163.9 mb
524121
Updated: 5 months ago
Public
0

MathVista is a consolidated Mathematical reasoning benchmark within Visual contexts.

1.2 gb
6.1K2
Updated: 5 months ago
Public
0

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

20 gb
8100K
Updated: 5 months ago