LLM-Feedback
Datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO.
41.6 mb
12
Public
0
19.1 mb
12
Public
0
519.5 mb
12
Public
0
19.5 mb
22
Public
0
24.2 mb
22
Public
0
321.7 mb
12
Datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO.