datasets
Organization Account
datasets's Repositories
Displaying Page 15 of 18 (179 total Repositories)

578.3 mb
23K12
Updated: 11 months ago

This dataset contains the arrival and departure events for buses up to the most recent completed month of 2022. Due to data collection issues, data is not guaranteed to be complete for any stop or date.

3.8 gb
1
Updated: 1 year ago
21

BabyLM Challenge 2024 - Sample efficient pretraining on a developmentally plausible corpus.

418.7 mb
242
Updated: 1 year ago
Public
2

The QA bAbI tasks are a set of proxy tasks that evaluate reading comprehension via question answering.

2.8 mb
1
Updated: 1 year ago

898.7 kB
32
Updated: 1 year ago
Public
2

3.2 gb
1107K7K
Updated: 1 year ago

3.9 gb
11107.1K
Updated: 1 year ago

A dataset of Arxiv Papers to build on top of for fine tuning an LLM

35.7 gb
122K23K
Updated: 1 year ago
Public
0

13.5 gb
1K19972
Updated: 1 year ago