Bessie
ox's Repositories
FUNSD
PublicA dataset for Text Detection, Optical Character Recognition, Spatial Layout Analysis and Form Understanding.
CelebA
PublicCelebFaces Attributes Dataset (CelebA) is a large-scale face attributes dataset with more than 200K celebrity images, each with 40 attribute annotations. The images in this dataset cover large pose variations and background clutter. CelebA has large diversities, large quantities, and rich annotations.
The Crowd Instance-level Human Parsing (CIHP) dataset has 38,280 diverse human images. Each image in CIHP is labeled with pixel-wise annotations for 20 categories, as well as instance-level identification. This dataset can be used for the "human part segmentation" task.
arXiv-paper-abstracts
Publicarxiv paper abstracts intended to be used for multi-label text classification.
Dataset of IMDB movie reviews with 100k training/test examples