Categories/Computer Vision/Image Captioning
Image Captioning Datasets

Image captioning is a computer vision task that generates a textual description of an image. This is an important task in computer vision that is used in many applications, such as search and Generative AI.

Displaying Page 1 of 1 (5 total Repositories)

A benchmark collection for sentence-based image description and search, consisting of 8,000 images that are each paired with five different captions which provide clear descriptions of the salient entities and events. … The images were chosen from six different Flickr groups, and tend not to contain any well-known people or locations, but were manually selected to depict a variety of scenes and situations.

1.1 gb
938.1K
Updated: 1 year ago

30.2 mb
22
Updated: 5 months ago