Collections/ox/llm-feedback

LLM-Feedback

Datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO.

41.6 mb
12
Updated: 4 months ago

19.1 mb
12
Updated: 4 months ago

519.5 mb
12
Updated: 4 months ago

183.5 mb
22
Updated: 4 months ago

19.5 mb
22
Updated: 4 months ago

24.2 mb
22
Updated: 4 months ago

26 mb
22
Updated: 4 months ago

321.7 mb
12
Updated: 4 months ago