Md Nurul Huda
nurul-oxen
User account
nurul-oxen's Repositories
Displaying Page 1 of 3 (23 total Repositories)
Public
11

The CIFAR-10 dataset (Canadian Institute for Advanced Research, 10 classes) is a subset of the Tiny Images dataset and consists of 60000 32x32 color images. The images are labelled with one of 10 mutually exclusive classes: airplane, automobile (but not truck or pickup truck), bird, cat, deer, dog, frog, horse, ship, and truck (but not pickup truck). There are 6000 images per class with 5000 training and 1000 testing images per class.

137.4 mb
260K3
Updated: 1 year ago

Sample Images for Stable Diffusion

503 kB
161
Updated: 1 year ago
Public
0

Ox Images for Dreambooth Stable Diffusion

868 kB
116
Updated: 1 year ago

This is the aerial video subset of the VIRAT Video Data collection.

4.9 gb
125
Updated: 1 year ago
Public
0

Part of UCF101 is an action recognition data set of realistic action videos, collected from YouTube, having 101 action categories.

395.7 mb
11.3K5
Updated: 1 year ago

The dataset contains 100K images with English text.

22.5 mb
11
Updated: 1 year ago
Public
0

The ESC-50 dataset is a labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification.

882.2 mb
2K11
Updated: 1 year ago
Public
0

AG News (AG’s News Corpus) is a subdataset of AG's corpus of news articles constructed by assembling titles and description fields of articles from the 4 largest classes (“World”, “Sports”, “Business”, “Sci/Tech”) of AG’s Corpus. The AG News contains 30,000 training and 1,900 test samples per class.

31.3 mb
12
Updated: 1 year ago
Public
0

Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage, or the question might be unanswerable.

128.8 mb
12
Updated: 1 year ago

CoVoST 2 is a large-scale multilingual speech translation corpus covering translations from 21 languages into English and from English into 15 languages. The dataset is created using Mozillas open-source Common Voice database of crowdsourced voice recordings.

1.4 gb
41K13
Updated: 1 year ago