This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 529 tags of 529 total tags for 385 datasets (1.37) »
|222||Ford Car Dataset||The Ford Car dataset is joint effort of Pandey et al. (for collecting images, Lidar points, calibration etc.) and us (for annotation of 2D and 3D objects). ...||car detection lidar 3d groundtruth sfm||link||2014-04-16||1627|
|186||Symmetry Facades||The Symmetry Facades dataset contains 9 building facades with multiple images. It used for coupled symmetry and structure from motion detection. Coupled Str...||symmetry facade building urban reconstruction sfm 3d repetition||link||2013-09-05||989|
|152||Colosseum and San Marco||The Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datasets are ...||3d, reconstruction, landmark, urban, sfm, aerial, streetside, flickr||link||2015-05-04||1174|
|135||Quad 6K||The Quad 6K dataset is a Structure-from-Motion dataset taken at Arts Quad at Cornell University campus and consists of 6514 images with ground truth positions o...||reconstruction, sfm, urban, groundtruth, landmark, 3d gps||link||2013-11-05||947|
|131||Dubrovnik6K and Rome16K||The Dubrovnik6K and Rome16K datasets are image collections for SfM reconstruction, where the suffix refers to the number of images in the dataset. Dubrovnik6...||reconstruction, sfm, urban, landmark, dubrovnik, rome||link||2017-03-10||883|
|127||Stable Structure from Motion||The Stable Structure from Motion datasets due to size limitations cannot put the images online. Instead here are the tracked image points and the final reconstr...||sfm, reconstruction, geometry, stability, robust, 3d, landmark, church||link||2013-08-08||1101|
|125||Google Street View Pittsburgh Research||The Google Street View Pittsburgh Research dataset is a street-level image collection provided by Google for research purposes. The dataset provided here co...||3d, reconstruction, sfm, urban, pittsburgh, panorama||n/a||2017-05-17||1765|
|123||CMU/VMR Urban Image+Laser||CMU/VMR Urban Image+Laser dataset contains 372 images linked with 3D laser points projections. There are additional images (due to the laser scanner being turne...||reconstruction, sfm, urban, semantic, segmentation, laser||link||2013-04-02||896|
|122||Symmetric Bundle Adjustment||The Symmetric Bundle Adjustment dataset contains four sequences of the CAB building, Barcelona, Redmond and Capitole for 3D reconstruction considering symmetrie...||reconstruction, sfm, urban, bundle adjustment, symmetry||link||2013-03-12||787|
|121||Oakland 3D||This repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purposes. ...||reconstruction, sfm, urban, semantic, segmentation, laser||link||2014-06-10||865|
|120||Samantha||The SAMANTHA (Structure-and-Motion Pipeline on a Hierarchical Cluster Tree) dataset contains 4 sequences for 3D reconstruction: Pozzoveggiani, Piazza Dante, Pia...||reconstruction, sfm, landmark, model, geometry||link||2013-03-12||1061|
|84||Aachen Retrieval||The Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with their SIFT ...||retrieval, aachen, landmark, sfm, reconstruction||link||2013-03-11||815|
|83||Ikonos Aerial||Since its launch in September 1999, Space Imaging IKONOS earth imaging satellite has provided a reliable stream of image data that has become the standard for c...||reconstruction, sfm, urban, aerial||link||2013-03-11||835|
|82||Zurich City Hall||Zurich City Hall dataset (also CIPA dataset) nformation: Place: City Hall, Zurich, Switzerland Number of Images: 15, 1280 x 1000 pixels Camera: Fuji DS 30...||reconstruction, sfm, urban, zurich||link||2013-03-11||770|
|74||PMVS 3D Photography||The following are multiview stereo data sets captured in our lab: a set of images, camera parameters and extracted apparent contours of a single rigid object. E...||sfm, reconstruction, depth, dense, mesh||link||2017-01-31||1023|
|73||Strecha Dense MVS||An evaluation benchmark for dense MVS for these datasets fountain-P11, Herz-Jesu-P8, entry-P10, castle-P19, Herz-Jesu-P25, castle-P30 . Images (corrected for...||sfm, reconstruction, benchmark, depth, dense, mesh||link||2014-11-11||1278|
|72||Acute3D Aiguille du Midi MVS||Aiguille du Midi. France showing photographs with Camera: Mamiya ZD. 55mm. - Resolution: 5Mpixels, 53 images - Photographer: B. Vallet (Imagine/EVD - 2006) ...||sfm, reconstruction, mesh, large scale, outdoor||link||2013-03-21||799|
|68||The KITTI Vision Benchmark Suite||We take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest are: ste...||stereo, depth, flow, detection tracking, reconstruction, sfm, odometry, segmentation, semantic car depth||link||2014-02-10||1138|
|67||Middlebury MVS Dino||The object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may wish to...||sfm, reconstruction, benchmark, multiview, 3d,||link||2013-09-20||814|
|66||Middlebury MVS Temple||The object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution of ground...||sfm, reconstruction, benchmark, multiview, 3d,||link||2013-09-20||702|
|63||Paris500k||The Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box rather ...||retrieval, paris, landmark, geotag, flickr, panoramio, sfm, reconstruction||link||2016-12-23||950|
|54||Notre Dame||The Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame by Mic...||limited, flickr, landmark, sfm, paris, frontview, reconstruction, 3d, pointcloud||link||2015-06-19||847|
|53||DTU Robot||The DTU Robot dataset consists of color images of 60 scenes acquired in a controlled setup from 119 different positions and under different lighting. For each s...||feature, detection, description, matching, sfm, reconstruction, illumination||link||2016-05-15||736|
|49||PhotoTourism Pair||The data is taken from Photo Tourism reconstructions from Trevi Fountain (Rome), Notre Dame (Paris) and Half Dome (Yosemite). Each dataset consists of a series ...||feature, matching, description, pair, sfm||link||2013-03-11||781|
|39||Leuven Stereo Scene||The Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper  by Leibe et al. for detection and...||segmentation, semantic, reconstruction, urban, sfm, 3d, leuven, depth, stereo||link||2013-11-03||1526|
|34||CamVid||The Cambridge-driving Labeled Video Database (CamVid) dataset from Gabriel Brostow [?] contains ten minutes of video footage and corresponding semantically labe...||sfm, depth, semantic, segmentation, urban||link||2016-04-18||2116|