This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 665 tags of 665 total tags for 470 datasets (1.41) »
|326||Desk3D (Cambridge University)||Instance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V., & Cipo...||depth instance pose detection||link||2016-04-15||842|
|314||WIDER FACE: A Face Detection Benchmark||WIDER FACE dataset is a large-scale face detection benchmark dataset with 32,203 images and 393,703 face annotations, which have high degree of variabilities in...||face detection scale pose occlusion||link||2016-02-11||1430|
|307||HandNet annotated hand dataset||The HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 214971 a...||hand articulation segmentation classification detection pose fingertip rgbd video||link||2017-09-12||1382|
|221||EPFL Multi-View Cars||Th EPFL Multi-View Car dataset contains 20 sequences of cars as they rotate by 360 degrees. There is one image approximately every 3-4 degrees. Using the time o...||pose multiview car detection estimation rotation||link||2014-02-10||1230|
|188||KTH Multiview Football||The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ...||multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget||link||2018-06-28||1874|
|161||ICG Annotated Facial Landmarks in the Wild (AFLW)||The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet...||face detection landmark pose age annotation||link||2017-07-25||2903|
|29||The Yale Face||The Yale Face dataset from A. Georghiades contains 5760 single light source images of ten subjects, each shown in 9 poses and 64 illumination setups (leading to...||face, pedestrian, detection, pose, illumination||link||2015-06-23||1048|
|25||PASCAL VOCs||The PASCAL VOC Challenge datasets by Mark Everingham is a yearly dataset which has a central evaluation server and the final test data is not released. The late...||detection segmentation pose pedestrian chair animal car building airplane||link||2017-03-09||1268|