LLM-SFT
Interesting datasets to supervise fine-tune (SFT) language models with.
Public
3
2.4 gb
62
8.6 mb
46
Public
1
This dataset contains message trees. Each message tree has an initial prompt message as the root node, which can have multiple child messages as replies, and these child messages can have multiple replies. All messages have a role property: this can either be "assistant" or "prompter". The roles in conversation threads from prompt to leaf node strictly alternate between "prompter" and "assistant". This version of the dataset contains data collected on the open-assistant.io website until Nov 5 2023.
22.3 mb
1
Public
0
795.2 mb
12
Public
0
1.2 mb
22