Categories/Computer Vision
Computer Vision Datasets

Computer vision is a field of artificial intelligence that trains computers to interpret and understand the visual world. Using digital images from cameras and videos and deep learning models, machines can accurately identify and classify objects — and then react to what they “see.”

Displaying Page 2 of 5 (42 total Repositories)
mnist
Public
Empty Repository

Updated: 10 months ago

Flickr8k
Public
Updated: 7 months ago

A benchmark collection for sentence-based image description and search, consisting of 8,000 images that are each paired with five different captions which provide clear descriptions of the salient entities and events. … The images were chosen from six different Flickr groups, and tend not to contain any well-known people or locations, but were manually selected to depict a variety of scenes and situations.

Updated: 2 weeks ago

56.6 mb
28331
UCF101
Public
Updated: 10 months ago

Updated: 3 weeks ago

56.6 mb
21833
Updated: 3 weeks ago

mnist
Public
Updated: 1 month ago

Flowers
Public
Updated: 6 months ago

An image classification dataset containing 3670 images of flowers across 5 classes: daisy, dandelion, roses, sunflowers, tulips. The images are of nonstandard sizes and aspect ratios, ranging from 500 x 442 px to 143 x 240 px.