ox/mosaicml-instruct-v3 | Datasets at Oxen.ai

Home Repositories Models Docs Blog Pricing Login Sign up

Search

mosaicml-instruct-v3

Data Branches Evaluations Fine-tune

mosaicml-instruct-v3

public

233.5 mb

2

mosaicml-instruct-v3

/

About

This is an aggregate dataset, comprised of Dolly HHRLHF (derived from the Databricks Dolly-15k and the Anthropic Helpful and Harmless (HH-RLHF) datasets), combined with Competition Math, Duorc, CoT GSM8k, Qasper, Quality, Summ Screen FD and Spider. The intention was to create a permissively-licensed instruction-following dataset with a large number of longform samples.

2 commits

1 contributor

1 download

233.5 mb

0 stars

Repository contents

2 tabular files

Contributors

Bessie

Copyright © 2026 Oxen Labs, Inc., All Rights Reserved

Careers Privacy Policy Terms and Conditions