Collections
LLM-SFT

Interesting datasets to supervise fine-tune (SFT) language models with.

a collection by ox

LLM-Feedback

Datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO.

a collection by ox

LLM-Eval

A list of standard benchmarks for LLM evaluation

a collection by ox

Multimodal

List of datasets that cross modalities, combinations of text, image, audio, video etc.

a collection by ox

Global Climate Challenge - Oceans

This collection has no description

a collection by oxbot