Featured Datasets
Public
32
59.1 gb
1651
based on https://huggingface.co/datasets/Norod78/simpsons-blip-captions
240.3 mb
23.1K4
based on https://huggingface.co/datasets/Norod78/simpsons-blip-captions
240.3 mb
43.1K2
Public
0
4.1 gb
63
0
687 mb
51
View all featured repositories
Featured Collections
Some of the Oxen team's favorite collections.
Visual LLMs
This collection is datasets for understanding of images with large language models
a collection by datasets
LLM-Feedback
Datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO.
a collection by ox
Multimodal
List of datasets that cross modalities, combinations of text, image, audio, video etc.
a collection by ox
Browse all collections