This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 665 tags of 665 total tags for 470 datasets (1.41) »
|388||Open Images Dataset v4 new||Today, we introduce Open Images, a dataset consisting of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories. We tried ...||classification large-scale category real image deep annotation automatic benchmark boundingbox||link||2018-09-11||519|
|272||Stanford 40 Actions||The Stanford 40 Actions dataset contains images of humans performing 40 actions. In each image, we provide a bounding box of the person who is performing the ac...||human action recognition detection boundingbox||link||2015-06-19||1260|
|147||FlickrLogos-32||The FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval methods...||flickr, logo, detection, retrieval, image, object recognition, machine learning, classification brand boundingbox||link||2018-03-08||1563|
|111||Grabcut||To evaluate our method we designed a new ground truth database of 50 images. The following zip-files contain: Data, Segmentation, Labelling - Lasso, Labelling -...||segmentation, boundingbox, color, optimization, background||link||2015-06-19||896|
|28||CMU Faces - Frontal faces||The MIT + CMU frontal face dataset from H. Rowley contains 130 images with 507 labeled frontal faces from movie, portrait and media sources. It is mostly graysc...||frontview, face, detection object boundingbox||link||2015-06-19||1085|
|14||INRIA People||The INRIA People dataset from Navneet Dalal and Bill Triggs [DalalCVPR2005] consists of training and testing data. The training contains 1805 images and X peopl...||detection, pedestrian, sideview, frontview, human, boundingbox||link||2015-06-19||1679|
|13||CBCL / MIT Pedestrian||MIT Pedestrian dataset from Papageorgiou and Poggio [IJCV2000] contains 509 training and 200 test images of pedestrians in city scenes (plus left-right reflecti...||pedestrian, frontview, detection, urban, people, boundingbox||link||2015-06-19||1243|