Collections/ox/llm-sft

LLM-SFT

Interesting datasets to supervise fine-tune (SFT) language models with.

2.4 gb
62
Updated: 3 months ago
0

8.6 mb
46
Updated: 2 months ago

This dataset contains message trees. Each message tree has an initial prompt message as the root node, which can have multiple child messages as replies, and these child messages can have multiple replies. All messages have a role property: this can either be "assistant" or "prompter". The roles in conversation threads from prompt to leaf node strictly alternate between "prompter" and "assistant". This version of the dataset contains data collected on the open-assistant.io website until Nov 5 2023.

22.3 mb
1
Updated: 6 months ago

795.2 mb
12
Updated: 3 months ago

1.2 mb
22
Updated: 2 months ago