Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   accuracy   action   activity   actor   address   adhead   adjustment   adult   aerial   aesthetics   affordance   age   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulation   artificial   aspect   asset   atmospheric   attention   attribute   australia   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   bark   baseline   behavior   belgium   benchmark   benchmarking   berlin   bike   bilateral   bim   binary   biology   biometric   biometry   blender   blur   boat   body   bone   bottle   boundingbox   brain   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   california   caltech   camera   canada   caption   captioning   capture   car   cardinal   categorization   category   cats   cbir   celebrity   cell   centered   ceramics   chair   challenge   change   chemistry   chest   chicaco   chromaticity   church   circle   city   cityscapes   classification   clinical   clothing   cloud   clustering   clutter   cnn   co-localization   co-saliency   co-segmentation   co-skeletonization   coco   code   codebook   coffee   collaboration   collaborative   color   community   comparison   computer   condition   constancy   context   contour   cooking   copyright   counting   cover   cow   crepe   crf   crop   cross-view   crowd   ct   cutting   daily   dance   dark   data   dataset   day   daylight   decomposition   deep   defocus   deformation   denoising   dense   depth   description   descriptor   detail   detection   dichromatic   disease   disgust   disparity   dna   dogs   domain   dped   driving   drone   dublin   dubrovnik   duplicate   dynamic   ear   edge   egocentric   ellipse   emotion   empty   endtoend   enhancement   environment   estimation   evaluation   event   exhibit   expertise   expression   eye   facade   face   facial   fake   family   fashion   fear   feature   field   fine-grained   fingerprint   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   fog   food   foot   footprint   foreground   forensics   fov   frames   frontview   fundus   gait   game   gan   gaze   gender   genetic   genome   geography   geometry   geoscience   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   glassware   global   google   gps   grammar   graphics   grayscale   graz   ground   groundtruth   group   growth   gsd   hand   handwritten   hd   head   heart   heat   hierarchy   high-definition   high-resolution   highlight   highway   holes   horse   hospital   house   howto   human   identification   illumination   illusion   image   imagenet   images   imdb   imu   indigenous   indoor   inertial   initialization   inserts   instance   intake   intensity   interaction   interactive   interest   internet   invariance   ir   isar   iso   joy   kaggle   kernels   keyframe   kimia   kinect   kinship   kitchen   kitti   label   labeling   laboratory   land   landmark   lane   language   large   large-scale   laser   lattice   layout   leaf   learning   letter   leuven   lidar   lifespan   light   lightfield   lighting   limited   line   lip   lisbon   liver   local   localization   location   logo   low-light   lowlevel   machine   makeup   manhattan   map   maritime   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   mocap   model   modeling   monitoring   mono   montage   motion   motorbike   mouse   mouth   movement   movie   mpeg   mser   mug   multi-agent   multi-camera   multi-class   multi-human   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multimedia   multimodal   multiple   multispectral   multitarget   multiview   museum   naming   natural   nature   navigation   netherlands   network   neutral   newyork   night   nir   noise   normal   nude   number   object   occlusion   ocr   odometry   omnidirection   omnidirectional   online   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   paintings   pair   pairwise   pan   panchromatic   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   people   person   perspective   phase   photo   photo-realistic   photogrammetry   physics   pittsburgh   place   plane   planning   plant   point   pointcloud   polygon   popularity   pornography   pose   potsdam   presentation   pressure   primitive   privacy   procedural   product   profile   project   proposal   pruning   ptz   quality   question   radar   random   rank   ranking   ransac   rate   ratio   re-identification   reading   real   real-world   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   relationship   remote   removal   render   rendering   repetition   resolution   restoration   retina   retinal   retrieval   rgb   rgbd   road   robot   robotic   robust   rome   room   ros   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   sculptures   search   segmentation   selfdriving   semantic   sense   sensing   sequence   series   sfm   shadow   shape   sheffield   shoe   shots   shutter   sideview   sign   similarity   simulation   simultaneous   single   singleview   size   skeleton   skeletonization   sketch   skin   sky   slam   smartphone   soccer   social   software   source   space   spain   spanish   speaker   speech   speed   sphere   sport   stability   stabilization   static   stationary   steganalysis   steganography   stereo   stereovision   stochastic   street   structure   structured   study   stuff   style   stylization   subpixel   subtraction   summarization   summary   superpixel   superresolution   supervised   supervisely   surface   surgery   surprise   surveillance   surveying.   swan   switzerland   sydney   symmetry   synthetic   table   target   task   taxonomy   temporal   text   textile   texture   texture-less   therapy   thermal   things   time   time-lapse   timelapse   timepieces   tiny   tokyo   tool   top-view   topcoder   tracking   tracklet   traffic   trajectory   transfer   transportation   tree   triangulation   truth   tuberculosis   turbulence   type   uas   uav   udacity   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vegetation   vehicle   velodyne   vessel   video   view   viewpoint   virtual   visible   vision   visual   voc   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wiki   wikipedia   wild   workflow   world   worldwide   xray   year   youtube   zoom   zurich  
«showing 682 tags of 682 total tags for 477 datasets (1.43) »


3d
DID Name Description Tags URL Date Views
478 UE4Sim and Sim4CV Sim4CV is the general environment for simulating data for computer vision tasks, like object tracking, pose estimation, detection, action recognition, indoor sc... object tracking, pose estimation, detection, action recognition, indoor scene understanding, multi-agent collaboration, autonomous navigation, 3d reconstruction, crowd understanding, urban scene understanding, human tracking, aerial surveying. simulation environment 3d photo-realistic realism depth segmentation urban rgb render link 2018-11-30 34
477 House3D: A Rich and Realistic 3D Environment House3D is a virtual 3D environment which consists of thousands of indoor scenes equipped with a diverse set of scene types, layouts and objects sourced from th... house indoor simulation environment 3d photo-realistic realism depth segmentation urban rgb render link 2018-11-30 13
473 2015 Dublin LiDAR 2015 Aerial Laser and Photogrammetry Survey of Dublin City Collection Record This record serves as an index to a suite of high density, aerial remote sensing... laser scan aerial flight urban city dublin pointcloud 3d lidar link 2018-10-10 41
472 human3.6m human3.6m dataset is one of the largest datasets for 3D human pose estimation. It consists of 3.6 million images featuring 11 actors performing 15 daily activ... human pose estimation camera video 3d laser scan action actor body part mocap link 2018-10-09 51
464 vegetation synthetic real Data from: Intercomparison of photogrammetry software for three-dimensional vegetation modelling Probst A, Gatziolis D, Strigul N Date Published: June 12,... vegetation synthetic real 3d model reconstruction software benchmark plant link 2018-08-09 66
462 Taskonomy The Taskonomy dataset consists of 3.9 Mil. Scenes, 600 Buildings, 25 Tags per Image, 1024 Resolution for taxonomy and transfer learning tasks. We provide a larg... transfer learning taxonomy task deep indoor 3d mesh pose camera high-resolution link 2018-08-08 62
459 MVSEC The Multi Vehicle Stereo Event Camera dataset is a collection of data designed for the development of novel 3D perception algorithms for event based cameras. St... event camera speed intensity dynamic gps imu 3d benchmark link 2018-05-30 93
435 Shadow Detection/Texture Segmentation The Shadow Detection/Texture Segmentation Computer Vision Dataset is focused around texture analysis, so each image sequence contains shadows moving in front o... shadow segmentation texture detection artificial 3d virtual noise illumination link 2018-11-20 175
432 Collaborative 3D reconstruction with smartphones collaborative 3d reconstruction with smartphones dataset: Six off-the-shelf Android smartphones captured video streams (Table 1, see below) of three cultural h... collaborative 3d reconstruction smartphone image cloud video link 2018-03-15 136
399 Osnabrück - Synthetic Scalable Cube Dataset Voxel Based Dataset for Systematic 3D reconstruction by artificial neural networks (ANNs). A synthetic scalable cube dataset for training, testing and valida... 3D, Deep Learning, Reconstruction, SfM, Synthetic city urban link 2018-02-13 395
394 Matterport 2D-3D-Semantics Data The 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotations. I... 3d panorama semantic segmentation depth normal indoor building reconstruction large-scale link 2017-07-27 539
384 An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects (T-LESS) A dataset acquired with 3 synchronized sensors (Primesense Carmine 1.09, Microsoft Kinect v2, Canon IXUS 950 IS), featuring: * 30 industry-relevant objects:... RGBD 3D pose texture-less object estimation link 2017-09-12 441
377 Lane Level Localization on a 3D Map The Lane Level Localization dataset was collected on a highway in San Francisco with the following properties: * Reasonable traffic * Multiple lane highway ... 3d map localization autonomous car driving gps benchmark video road link 2017-05-10 491
376 ScanNet ScanNet is an RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and instance-le... scene indoor synthetic cad room layout rendering realism 3d segmentation object recognition link 2017-05-12 469
375 SUNCG: Indoor Scenes The SUNCG dataset is a Large 3D Model Repository for Indoor Scenes. SUNCG is an ongoing effort to establish a richly-annotated, large-scale dataset of 3D s... scene indoor synthetic room layout rendering realism 3d segmentation object recognition link 2018-10-17 609
374 SceneNet RGB-D Synthetic Indoor SceneNet RGB-D is dataset comprised of 5 million Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth. It expands the previous work of Scene... scene indoor synthetic robot navigation rendering 3d reconstruction trajectory lighting segmentation slam link 2017-05-02 507
355 IMPART multi-modal/multi-view The multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. The sourc... multi-view multi-mode video rgbd lidar 3d model color indoor outdoor dynamic action face human emotion link 2017-01-01 613
351 CMLA Subpixel Stereo Dataset A 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very precise t... stereo stereovision subpixel groundtruth 3D pointcloud noise depth link 2018-03-12 659
311 ASL Datasets Repository This site is dedicated to provide datasets for the Robotics community with the aim to facilitate result evaluations and comparisons. The datasets presented on t... laser 3d urban nature city link 2015-10-28 768
303 1DSfM Landmarks The 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler ground tru... 3d reconstruction landmark groundtruth benchmark urban city link 2018-12-11 1023
299 CAMP-TUM: Multiple Human Pose Estimation from Multiple Views We introduce the Shelf dataset for multiple human pose estimation from multiple views. In addition we annotate the body joints in the Campus dataset from CVLAB@... 3D human pose estimation multiple view motion capture link 2015-07-15 872
287 INRIA Lafarge Benchmarks Some datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting Line-ne... 3d surface reconstruction groundtruth pointcloud object detection line road network urban crowd pedestrian counting link 2015-06-18 1235
285 ISPRS-EuroSDR Multi-Platform ISPRS / EuroSDR Benchmark for Multi-Platform Photogrammetry In these pages you can get information about the BENCHMARK FOR MULTI-PLATFORM PHOTOGRAMMETRY unde... aerial multiview 3d photogrammetry germany switzerland urban city benchmark reconstruction link 2015-06-16 913
283 ISPRS WG III/4 ISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get further inform... aerial multiview 3d photogrammetry germany canada semantic segmentation urban city recognition benchmark link 2015-06-16 972
282 ISPRS-EuroSDR HighDensity ISPRS and EuroSDR - Benchmark on High Density Aerial Image Matching Background and Scope of the project Innovations in matching algorithms as well as the... aerial multiview 3d photogrammetry germany switzerland urban city benchmark reconstruction link 2015-06-16 886
280 Yahoo Flickr Creative Commons 100M Yahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. All the ... flickr landmark image recognition detection reconstruction 3d clustering social community internet link 2015-09-24 1273
271 Labeling in 3D Scenes This dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: Detecti... 3d kinect reconstruction indoor depth object recognition link 2015-03-16 1045
270 B3DO: Berkeley 3D Object Dataset For the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only with the a... 3d kinect reconstruction indoor depth object recognition link 2015-03-16 928
267 3DVis The 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, repetitiv... 3d reconstruction matching registration shape symmetry link 2015-01-26 841
259 MOT Challenge 2D and 3D The MOT Challenge is a framework for the fair evaluation of multiple people tracking algorithms. In this framework we provide: - A large collection of datase... 3d tracking multiple target benchmark dataset people pedestrian surveillance video link 2015-07-31 1586
255 Robotic 3D Scan Repository The Robotic 3D Scan Repository from Osnabrueck contains 23 different datasets showing a veriaty of 3D scans for objects, humans, cities, university campus, heat... 3d reconstruction scan laser heat urban city human aerial germany bremen lidar osnabrueck link 2015-04-10 1104
251 ETHZ CVL RueMonge 2014 This ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1] and p... semantic segmentation 3d reconstruction architecture paris benchmark source code urban recognition classification outdoor pointcloud mesh link 2014-11-24 1770
246 Bristol Egocentric Object Interactions Dataset The BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is recorded... video interaction object egocentric pose 3d tracking link 2017-09-12 1385
229 Paris Rue Madame Paris-rue-Madame dataset contains 3D Mobile Laser Scanning (MLS) data from rue Madame, a street in the 6th Parisian district (France). The test zone contains ap... semantic segmentation pointcloud 3d laser classification link 2014-06-10 919
228 MPI VehicleScenes Abstract Scene understanding has (again) become a focus of computer vision research, leveraging advances in detection, context modeling, and tracking. In thi... semantic segmentation scene understanding classification 3d car pedestrian link 2014-06-10 1419
223 SHOT 3D shape description The 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some of the m... 3d shape description benchmark reconstruction registration matching link 2015-06-21 1208
222 Ford Car Dataset The Ford Car dataset is joint effort of Pandey et al. (for collecting images, Lidar points, calibration etc.) and us (for annotation of 2D and 3D objects). ... car detection lidar 3d groundtruth sfm link 2014-04-16 2364
220 3D Mask Attack Dataset The 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for both re... 3d biometry face recognition segmentation frontview emotion link 2016-03-14 1412
208 Landmark 1000 The Landmark 1000 or 1k dataset is a collection of the top 1000 popular flickr landmarks mined from flickr. It is maintained by Noah Snavely and published in... landmark 3d reconstruction pose estimation pointcloud world location link 2013-11-05 1335
200 Landmark 3D This dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope it coul... landmark recognition classification retrieval 3d reconstruction codebook matching feature flickr link 2016-08-09 1319
196 New College Data The New College Data Set contains 30GB of data intended for use by the mobile robotics and vision research communities. Our anticipated users are parties intere... odometry urban path 3d reconstruction panorama stereo navigation link 2013-09-30 1090
195 Yotta The Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or driving. ... semantic segmentation urban video camera 3d reconstruction classification link 2013-09-30 1138
194 HCI 4D Lightfields The HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below. For ma... 3d 4d lightfield benchmark depth reconstruction evaluation link 2017-04-28 1653
193 City planar and non-planar The city planar and non-planar datset consists of urban scenes accompanied by text files describing the plane/non-plane locations. Training Set (University)... plane detection 3d urban building estimation link 2013-09-23 972
189 Farman Institute 3D Point Sets The Farman Institute 3D Point Sets dataset contains 11 objects by a 3D laser scanner. This dataset was peer-reviewed by Image Processing On Line: Farman Institu... 3d laser scanner object reconstruction model point link 2013-09-18 954
186 Symmetry Facades The Symmetry Facades dataset contains 9 building facades with multiple images. It used for coupled symmetry and structure from motion detection. Coupled Str... symmetry facade building urban reconstruction sfm 3d repetition link 2013-09-05 1401
182 MSR Action The MSR Action datasets is a collection of various 3D datasets for action recognition. See details http://research.microsoft.com/en-us/um/people/zliu/action... video action recognition detection reconstruction 3d link 2013-09-05 1241
181 All I Have Seen (AIHS) The All I Have Seen (AIHS) dataset is created to study the properties of total visual input in humans, for around two weeks Nebojsa Jojic wore a camera capturin... video summary user study clustering similarity outdoor indoor scene 3d link 2018-09-19 1029
152 Colosseum and San Marco The Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datasets are ... 3d, reconstruction, landmark, urban, sfm, aerial, street, flickr link 2017-11-28 1659
143 KITTI Odometry http://www.cvlibs.net/datasets/kitti/eval_odometry.php Related Datasets TUM RGB-D Dataset: Indoor dataset captured with Microsoft Kinect and high-accuracy... registration, localization, odometry, slam, matching, navigation, urban path 3d reconstruction link 2013-09-30 1474
140 RGB-D Person Re-identification The RGB-D Person Re-identification dataset is for person re-identification using depth information. The main motivation is that the standard techniques (such as... identification, classification, shape, depth, pedestrian, 3d link 2014-10-08 1302
137 Synthetic CAD models The Synthetic CAD Models dataset consists of X synthetic CAD models for detection (planar) primitives. Efficient RANSAC for Point-Cloud Shape Detection Ruwe... model, ransac, 3d object, reconstruction, primitive, synthetic link 2013-08-08 1074
135 Quad 6K The Quad 6K dataset is a Structure-from-Motion dataset taken at Arts Quad at Cornell University campus and consists of 6514 images with ground truth positions o... reconstruction, sfm, urban, groundtruth, landmark, 3d gps link 2013-11-05 1332
127 Stable Structure from Motion The Stable Structure from Motion datasets due to size limitations cannot put the images online. Instead here are the tracked image points and the final reconstr... sfm, reconstruction, geometry, stability, robust, 3d, landmark, church link 2013-08-08 1661
126 ISPRS Urban Classification ISPRS Test Project on Urban Classification and 3D Building Reconstruction The ISPRS working group III/4 announces the release of the 2D semantic labeling ben... 3d, reconstruction, building, urban, city, semantic, classification, recognition link 2014-11-24 1023
125 Google Street View Pittsburgh Research The Google Street View Pittsburgh Research dataset is a street-level image collection provided by Google for research purposes. The dataset provided here co... 3d, reconstruction, sfm, urban, pittsburgh, panorama link 2018-02-15 2572
121 Oakland 3D This repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purposes. ... reconstruction, sfm, urban, semantic, segmentation, laser lidar 3d city link 2018-10-10 1279
112 SHREC Unlike the previous SHREC contests, the objective of this SHREC 2012 contest is to evaluate the performance of 3D-mesh segmentation techniques instead of evalua... segmentation, mesh, part, 3d link 2013-07-29 859
67 Middlebury MVS Dino The object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may wish to... sfm, reconstruction, benchmark, multiview, 3d, link 2013-09-20 1178
66 Middlebury MVS Temple The object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution of ground... sfm, reconstruction, benchmark, multiview, 3d, link 2013-09-20 1077
54 Notre Dame The Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame by Mic... limited, flickr, landmark, sfm, paris, frontview, reconstruction, 3d, pointcloud link 2015-06-19 1264
39 Leuven Stereo Scene The Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detection and... segmentation, semantic, reconstruction, urban, sfm, 3d, leuven, depth, stereo link 2018-06-28 2303


total views: 61506 5 queries in 0.00013303756713867s 0.00011992454528809s 0.00018405914306641s 0.00012612342834473s 0.0017690658569336s and total 0.008234977722168s