nurul-oxen (Md Nurul Huda) Repositories

Md Nurul Huda

nurul-oxen

User account

nurul-oxen's Repositories

Displaying Page 2 of 3 (23 total Repositories)

DigitDataset

Public

Optical Recognition of Handwritten Digits Data Set

45.9 mb

70K12

Updated: 2 years ago

50 speakers audio data with length more than 1 hour for each. Further, data converted to wav format, 16KHz, mono channel and is split into 1min chunks. This dataset can be used for speaker recognition kind of problems. This dataset was scraped from YouTube and Librivox.

5 gb

2.5K11

Updated: 2 years ago

DodgersLoopSensorData

Public

Loop sensor data was collected for the Glendale on ramp for the 101 North freeway in Los Angeles

1.4 mb

Updated: 2 years ago

SpeechCommand

Public

This is a small excerpt of the [Speech Commands Dataset].

156.9 mb

1124.9K

Updated: 2 years ago

MiniSpeechCommands

Public

An audio dataset of spoken words designed to help train and evaluate keyword spotting systems. Its primary goal is to provide a way to build and test small models that detect when a single word is spoken, from a set of ten target words, with as few false positives as possible from background noise or unrelated speech.

252.5 mb

8K81

Updated: 2 years ago

CIFAR-100

Public

The CIFAR-100 dataset consists of 60000 32x32 colour images in 100 classes, with 600 images per class. There are 500 training images and 100 testing images per class. There are 50000 training images and 10000 test images. The 100 classes are grouped into 20 superclasses. There are two labels per image - fine label (actual class) and coarse label (superclass).

149.8 mb

160K2

Updated: 2 years ago

AMAZONREVIEWFULL

Public

Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazon’s iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

1.6 gb

Updated: 2 years ago

LinnerrudDataset

Public

The Linnerud dataset is a multi-output regression dataset. It consists of three exercise (data) and three physiological (target) variables collected from twenty middle-aged men in a fitness club:

1.1 kB

Updated: 2 years ago

DiabetesDataset

Public

This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective is to predict based on diagnostic measurements whether a patient has diabetes.

89.9 kB

Updated: 2 years ago

BostonHousing

Public

The Boston Housing Dataset. A Dataset derived from information collected by the U.S. Census Service concerning housing in the area of Boston Mass.

41 kB

Updated: 2 years ago