Computer Vision Datasets
Computer vision is a field of artificial intelligence that trains computers to interpret and understand the visual world. Using digital images from cameras and videos and deep learning models, machines can accurately identify and classify objects — and then react to what they “see.”
Displaying Page 5 of 16 (156 total Repositories)
466.3 mb
2304K
871.6 mb
9.8K2
346.8 mb
8.2K
6.3 mb
10112
3.9 mb
12100
A benchmark collection for sentence-based image description and search, consisting of 8,000 images that are each paired with five different captions which provide clear descriptions of the salient entities and events. … The images were chosen from six different Flickr groups, and tend not to contain any well-known people or locations, but were manually selected to depict a variety of scenes and situations.
1.1 gb
938.1K