Featured Datasets
Public
0
531.4 mb
21
Public
0
531.4 mb
12
Public
0
8.4 gb
42
Public
0
8.4 gb
24
384.1 mb
21
81.8 mb
22
Public
0
1 gb
21
View all featured repositories
Featured Collections
Some of the Oxen team's favorite collections.
Visual LLMs
This collection is datasets for understanding of images with large language models
a collection by datasets
LLM-Feedback
Datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO.
a collection by ox
Multimodal
List of datasets that cross modalities, combinations of text, image, audio, video etc.
a collection by ox
Browse all collections