This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 562 tags of 562 total tags for 409 datasets (1.37) »
|384||An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects (T-LESS)||A dataset acquired with 3 synchronized sensors (Primesense Carmine 1.09, Microsoft Kinect v2, Canon IXUS 950 IS), featuring: * 30 industry-relevant objects:...||RGBD 3D pose texture-less object estimation||link||2017-09-12||92|
|333||UBC3V Dataset||UBC3V is a synthetic dataset for training and evaluation of single or multiview depth-based pose estimation techniques. The nature of the data is similar to the...||depth segmentation pose||link||2016-08-18||417|
|326||Desk3D (Cambridge University)||Instance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V., & Cipo...||depth instance pose detection||link||2016-04-15||501|
|314||WIDER FACE: A Face Detection Benchmark||WIDER FACE dataset is a large-scale face detection benchmark dataset with 32,203 images and 393,703 face annotations, which have high degree of variabilities in...||face detection scale pose occlusion||link||2016-02-11||908|
|307||HandNet annotated hand dataset||The HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 214971 a...||hand articulation segmentation classification detection pose fingertip rgbd video||link||2017-09-12||849|
|299||CAMP-TUM: Multiple Human Pose Estimation from Multiple Views||We introduce the Shelf dataset for multiple human pose estimation from multiple views. In addition we annotate the body joints in the Campus dataset from CVLAB@...||3D human pose estimation multiple view motion capture||link||2015-07-15||576|
|261||MPI Multi-View Collection GVV datasets||Welcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects of the G...||video multiview tracking face mesh reconstruction depth human action pose||link||2014-12-10||695|
|260||Eurasian Cities dataset||The Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing points ...||vanishing line point geometry pose urban reconstruction outdoor manhattan||link||2016-11-29||874|
|246||Bristol Egocentric Object Interactions Dataset||The BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is recorded...||video interaction object egocentric pose 3d tracking||link||2017-09-12||969|
|221||EPFL Multi-View Cars||Th EPFL Multi-View Car dataset contains 20 sequences of cars as they rotate by 360 degrees. There is one image approximately every 3-4 degrees. Using the time o...||pose multiview car detection estimation rotation||link||2014-02-10||978|
|208||Landmark 1000||The Landmark 1000 or 1k dataset is a collection of the top 1000 popular flickr landmarks mined from flickr. It is maintained by Noah Snavely and published in...||landmark 3d reconstruction pose estimation pointcloud world location||link||2013-11-05||977|
|188||KTH Multiview Football||The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ...||multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget||link||2016-09-18||1302|
|161||ICG Annotated Facial Landmarks in the Wild (AFLW)||The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet...||face detection landmark pose age annotation||link||2017-07-25||1985|
|117||YorkUrbanDB||The York Urban Line Segment Database is a compilation of 102 images (45 indoor, 57 outdoor) of urban environments consisting mostly of scenes from the campus of...||vanishing, point, pose, urban, reconstruction, outdoor, geometry, manhattan||link||2013-09-18||681|
|29||The Yale Face||The Yale Face dataset from A. Georghiades contains 5760 single light source images of ten subjects, each shown in 9 poses and 64 illumination setups (leading to...||face, pedestrian, detection, pose, illumination||link||2015-06-23||761|
|27||Idiap/ETHZ Faces and Poses||Idiap/ETHZ Faces and Poses Dataset dataset by L. Jie, B. Caputo and V. Ferrari contains 1703 image-caption pairs. [author] Captions contain the names of some of...||face, pose, pedestrian, text||link||2013-03-11||767|
|26||We Are Family Stickmen||The We Are Family Stickmen dataset from Eichner and Ferrari contains X images with X people in group photos for human pose estimation with annotated 2D human bo...||pose, pedestrian, body part||link||2013-03-11||795|
|25||PASCAL VOCs||The PASCAL VOC Challenge datasets by Mark Everingham is a yearly dataset which has a central evaluation server and the final test data is not released. The late...||detection segmentation pose pedestrian chair animal car building airplane||link||2017-03-09||971|