Md Nurul Huda
nurul-oxen
User account
nurul-oxen's Repositories
Displaying Page 2 of 3 (23 total Repositories)

Optical Recognition of Handwritten Digits Data Set

45.9 mb
70K12
Updated: 1 year ago

50 speakers audio data with length more than 1 hour for each. Further, data converted to wav format, 16KHz, mono channel and is split into 1min chunks. This dataset can be used for speaker recognition kind of problems. This dataset was scraped from YouTube and Librivox.

5 gb
2.5K11
Updated: 1 year ago

Loop sensor data was collected for the Glendale on ramp for the 101 North freeway in Los Angeles

1.4 mb
21
Updated: 1 year ago

This is a small excerpt of the [Speech Commands Dataset].

156.9 mb
1124.9K
Updated: 2 years ago

An audio dataset of spoken words designed to help train and evaluate keyword spotting systems. Its primary goal is to provide a way to build and test small models that detect when a single word is spoken, from a set of ten target words, with as few false positives as possible from background noise or unrelated speech.

252.5 mb
8K81
Updated: 2 years ago
Public
0

The CIFAR-100 dataset consists of 60000 32x32 colour images in 100 classes, with 600 images per class. There are 500 training images and 100 testing images per class. There are 50000 training images and 10000 test images. The 100 classes are grouped into 20 superclasses. There are two labels per image - fine label (actual class) and coarse label (superclass).

149.8 mb
160K2
Updated: 2 years ago

Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazon’s iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

1.6 gb
12
Updated: 2 years ago

The Linnerud dataset is a multi-output regression dataset. It consists of three exercise (data) and three physiological (target) variables collected from twenty middle-aged men in a fitness club:

1.1 kB
11
Updated: 2 years ago

This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective is to predict based on diagnostic measurements whether a patient has diabetes.

89.9 kB
11
Updated: 2 years ago

The Boston Housing Dataset. A Dataset derived from information collected by the U.S. Census Service concerning housing in the area of Boston Mass.

41 kB
11
Updated: 2 years ago