nurul-oxen (Md Nurul Huda) Repositories

nurul-oxen

User Account

Repositories

Md Nurul Huda

nurul-oxen

User account

nurul-oxen's Repositories

CIFAR-10

public

The CIFAR-10 dataset (Canadian Institute for Advanced Research, 10 classes) is a subset of the Tiny Images dataset and consists of 60000 32x32 color images. The images are labelled with one of 10 mutually exclusive classes: airplane, automobile (but not truck or pickup truck), bird, cat, deer, dog, frog, horse, ship, and truck (but not pickup truck). There are 6000 images per class with 5000 training and 1000 testing images per class.

137.4 mb

260K3

Updated: 3 years ago

ShakibAlHasan

public

Sample Images for Stable Diffusion

503 kB

161

Updated: 3 years ago

OxImages

public

Ox Images for Dreambooth Stable Diffusion

868 kB

116

Updated: 3 years ago

VIRAT-Aerial

public

This is the aerial video subset of the VIRAT Video Data collection.

4.9 gb

125

Updated: 3 years ago

UCF-05

public

Part of UCF101 is an action recognition data set of realistic action videos, collected from YouTube, having 101 action categories.

395.7 mb

11.3K5

Updated: 3 years ago

Laion-100K

public

The dataset contains 100K images with English text.

22.5 mb

Updated: 3 years ago

ESC-50

public

The ESC-50 dataset is a labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification.

882.2 mb

2K11

Updated: 3 years ago

AG_NEWS

public

AG News (AG’s News Corpus) is a subdataset of AG's corpus of news articles constructed by assembling titles and description fields of articles from the 4 largest classes (“World”, “Sports”, “Business”, “Sci/Tech”) of AG’s Corpus. The AG News contains 30,000 training and 1,900 test samples per class.

31.3 mb

Updated: 3 years ago

SQuAD

public

Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage, or the question might be unanswerable.

128.8 mb

Updated: 3 years ago

CoVoST-Delta-Segment-12

public

CoVoST 2 is a large-scale multilingual speech translation corpus covering translations from 21 languages into English and from English into 15 languages. The dataset is created using Mozillas open-source Common Voice database of crowdsourced voice recordings.

1.4 gb

41K13

Updated: 3 years ago