This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 697 tags of 697 total tags for 514 datasets (1.36) »
|484||Flickr30k Entities||We propose to use the visual denotations of linguistic expressions (i.e. the set of images they describe) to define novel denotational similarity metrics, which...||phrase grounding caption text analysis image description flickr association video link||link||2019-01-23||348|
|397||MPI-I VISPR (Visual Privacy)||We present a dataset to address the problem of visual privacy - where users unintentionally leak private information when sharing personal images online, such a...||privacy multilabel classification flickr scene regression||link||2018-04-13||647|
|294||Happy People Images Database||Group emotion recognition in images - Happiness Intensity labels for group of people in images. The images have been collected from Flickr using keyword search ...||group, facial expression, emotion, wild, human, flickr, behavior||link||2019-12-19||1290|
|280||Yahoo Flickr Creative Commons 100M||Yahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. All the ...||flickr landmark image recognition detection reconstruction 3d clustering social community internet||link||2015-09-24||1897|
|200||Landmark 3D||This dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope it coul...||landmark recognition classification retrieval 3d reconstruction codebook matching feature flickr||link||2016-08-09||1628|
|152||Colosseum and San Marco||The Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datasets are ...||3d, reconstruction, landmark, urban, sfm, aerial, street, flickr||link||2017-11-28||2025|
|147||FlickrLogos-32||The FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval methods...||flickr, logo, detection, retrieval, image, object recognition, machine learning, classification brand boundingbox||link||2019-11-12||2553|
|63||Paris500k||The Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box rather ...||retrieval, paris, landmark, geotag, flickr, panoramio, sfm, reconstruction||link||2019-05-22||1860|
|54||Notre Dame||The Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame by Mic...||limited, flickr, landmark, sfm, paris, frontview, reconstruction, 3d, pointcloud||link||2015-06-19||1705|