This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 684 tags of 684 total tags for 487 datasets (1.4) »
|478||UE4Sim and Sim4CV||Sim4CV is the general environment for simulating data for computer vision tasks, like object tracking, pose estimation, detection, action recognition, indoor sc...||object tracking, pose estimation, detection, action recognition, indoor scene understanding, multi-agent collaboration, autonomous navigation, 3d reconstruction, crowd understanding, urban scene understanding, human tracking, aerial surveying. simulation environment 3d photo-realistic realism depth segmentation urban rgb render||link||2018-11-30||147|
|472||human3.6m||human3.6m dataset is one of the largest datasets for 3D human pose estimation. It consists of 3.6 million images featuring 11 actors performing 15 daily activ...||human pose estimation camera video 3d laser scan action actor body part mocap||link||2018-10-09||143|
|470||MVOR||MVOR is a Multi-view Multi-person RGB-D Operating Room Dataset for 2D and 3D Human Pose Estimation We are pleased to announce the release of the MVOR datase...||medical clinical human annotation multiview pose estimation rgbd operation hospital||link||2018-10-08||98|
|462||Taskonomy||The Taskonomy dataset consists of 3.9 Mil. Scenes, 600 Buildings, 25 Tags per Image, 1024 Resolution for taxonomy and transfer learning tasks. We provide a larg...||transfer learning taxonomy task deep indoor 3d mesh pose camera high-resolution||link||2018-08-08||105|
|384||An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects (T-LESS)||A dataset acquired with 3 synchronized sensors (Primesense Carmine 1.09, Microsoft Kinect v2, Canon IXUS 950 IS), featuring: * 30 industry-relevant objects:...||RGBD 3D pose texture-less object estimation||link||2017-09-12||476|
|333||UBC3V Dataset||UBC3V is a synthetic dataset for training and evaluation of single or multiview depth-based pose estimation techniques. The nature of the data is similar to the...||depth segmentation pose||link||2016-08-18||958|
|326||Desk3D (Cambridge University)||Instance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V., & Cipo...||depth instance pose detection||link||2016-04-15||945|
|314||WIDER FACE: A Face Detection Benchmark||WIDER FACE dataset is a large-scale face detection benchmark dataset with 32,203 images and 393,703 face annotations, which have high degree of variabilities in...||face detection scale pose occlusion||link||2016-02-11||1537|
|307||HandNet annotated hand dataset||The HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 214971 a...||hand articulation segmentation classification detection pose fingertip rgbd video||link||2017-09-12||1579|
|299||CAMP-TUM: Multiple Human Pose Estimation from Multiple Views||We introduce the Shelf dataset for multiple human pose estimation from multiple views. In addition we annotate the body joints in the Campus dataset from CVLAB@...||3D human pose estimation multiple view motion capture||link||2015-07-15||912|
|261||MPI Multi-View Collection GVV datasets||Welcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects of the G...||video multiview tracking face mesh reconstruction depth human action pose||link||2014-12-10||1023|
|260||Eurasian Cities dataset||The Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing points ...||vanishing line point geometry pose urban reconstruction outdoor manhattan||link||2018-01-11||1351|
|246||Bristol Egocentric Object Interactions Dataset||The BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is recorded...||video interaction object egocentric pose 3d tracking||link||2017-09-12||1437|
|221||EPFL Multi-View Cars||Th EPFL Multi-View Car dataset contains 20 sequences of cars as they rotate by 360 degrees. There is one image approximately every 3-4 degrees. Using the time o...||pose multiview car detection estimation rotation||link||2014-02-10||1326|
|208||Landmark 1000||The Landmark 1000 or 1k dataset is a collection of the top 1000 popular flickr landmarks mined from flickr. It is maintained by Noah Snavely and published in...||landmark 3d reconstruction pose estimation pointcloud world location||link||2013-11-05||1376|
|188||KTH Multiview Football||The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ...||multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget||link||2018-06-28||2070|
|161||ICG Annotated Facial Landmarks in the Wild (AFLW)||The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet...||face detection landmark pose age annotation||link||2017-07-25||3300|
|117||YorkUrbanDB||The York Urban Line Segment Database is a compilation of 102 images (45 indoor, 57 outdoor) of urban environments consisting mostly of scenes from the campus of...||vanishing, point, pose, urban, reconstruction, outdoor, geometry, manhattan||link||2013-09-18||964|
|29||The Yale Face||The Yale Face dataset from A. Georghiades contains 5760 single light source images of ten subjects, each shown in 9 poses and 64 illumination setups (leading to...||face, pedestrian, detection, pose, illumination||link||2015-06-23||1155|
|27||Idiap/ETHZ Faces and Poses||Idiap/ETHZ Faces and Poses Dataset dataset by L. Jie, B. Caputo and V. Ferrari contains 1703 image-caption pairs. [author] Captions contain the names of some of...||face, pose, pedestrian, text||link||2013-03-11||1101|
|26||We Are Family Stickmen||The We Are Family Stickmen dataset from Eichner and Ferrari contains X images with X people in group photos for human pose estimation with annotated 2D human bo...||pose, pedestrian, body part||link||2013-03-11||1158|
|25||PASCAL VOCs||The PASCAL VOC Challenge datasets by Mark Everingham is a yearly dataset which has a central evaluation server and the final test data is not released. The late...||detection segmentation pose pedestrian chair animal car building airplane||link||2017-03-09||1398|