Categories/Computer Vision/Image Captioning
Image Captioning Datasets

Image captioning is a computer vision task that generates a textual description of an image. This is an important task in computer vision that is used in many applications, such as search and Generative AI.

Displaying Page 1 of 1 (2 total Repositories)
Flickr8k
Public
Updated: 7 months ago

A benchmark collection for sentence-based image description and search, consisting of 8,000 images that are each paired with five different captions which provide clear descriptions of the salient entities and events. … The images were chosen from six different Flickr groups, and tend not to contain any well-known people or locations, but were manually selected to depict a variety of scenes and situations.