Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   accuracy   action   activity   address   adhead   adjustment   adult   aerial   aesthetics   affordance   age   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulation   artificial   aspect   atmospheric   attention   attribute   attributes   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   berlin   bike   bilateral   bim   binary   biology   biometric   biometry   blender   blur   boat   body   bone   bottle   boundingbox   brain   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   california   caltech   camera   canada   caption   captioning   capture   car   cardinal   categorization   category   cats   cbir   celebrity   cell   centered   chair   challenge   change   chemistry   chest   chicaco   chromaticity   church   circle   city   cityscapes   classification   clothing   cloud   clustering   clutter   cnn   co-localization   co-saliency   co-segmentation   co-skeletonization   coco   code   codebook   coffee   collaborative   color   community   comparison   computer   condition   constancy   context   contour   cooking   copyright   counting   cover   cow   crepe   crf   crop   cross-view   crowd   ct   cutting   daily   dance   dark   data   dataset   day   daylight   decomposition   deep   defocus   deformation   denoising   dense   depth   description   descriptor   detail   detection   dichromatic   disease   disgust   disparity   dogs   domain   dped   driving   drone   dubrovnik   duplicate   dynamic   ear   edge   egocentric   ellipse   emotion   empty   endtoend   enhancement   environment   estimation   evaluation   event   expertise   expression   eye   facade   face   facial   fake   fashion   fear   feature   field   fine-grained   fingerprint   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   fog   food   foot   footprint   foreground   fov   frames   frontview   fundus   gait   game   gan   gaze   gender   genetic   genome   geography   geometry   geoscience   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   grayscale   graz   ground   groundtruth   group   growth   gsd   hand   handwritten   hd   head   heart   heat   hierarchy   high-definition   high-resolution   highlight   highway   holes   horse   house   howto   human   identification   illumination   illuminiation   illusion   image   imagenet   images   imdb   imu   indoor   inertial   initialization   inserts   instance   intake   intensity   interaction   interactive   interest   internet   invariance   ir   isar   iso   joy   kaggle   kernels   keyframe   kimia   kinect   kitchen   kitti   label   labeling   laboratory   land   landmark   lane   language   large   large-scale   laser   lattice   layout   leaf   learning   letter   leuven   lidar   lifespan   light   lightfield   lighting   limited   line   lip   lisbon   liver   local   localization   location   logo   low   lowlevel   machine   makeup   manhattan   map   maritime   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   model   modeling   monitoring   mono   montage   motion   motorbike   mouse   mouth   movement   movie   mpeg   mser   mug   multi-camera   multi-class   multi-human   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multimedia   multimodal   multiple   multispectral   multitarget   multiview   naming   natural   nature   navigation   netherlands   network   neutral   newyork   night   nir   noise   normal   nude   number   object   occlusion   ocr   odometry   omnidirection   omnidirectional   online   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   pan   panchromatic   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   pedestrians   people   person   perspective   phase   photo   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   polygon   popularity   pornography   pose   potsdam   presentation   pressure   primitive   privacy   procedural   profile   project   proposal   pruning   ptz   quality   question   radar   random   rank   ranking   ransac   rate   ratio   re-identification   reading   real   real-world   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   remote   removal   rendering   repetition   resolution   restoration   retina   retinal   retrieval   rgb   rgbd   road   robot   robotic   robust   rome   room   ros   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   search   segmentation   selfdriving   semantic   sense   sensing   sequence   series   sfm   shadow   shape   sheffield   shoes   shots   shutter   sideview   sign   signs   similarity   simultaneous   single   singleview   size   skeleton   skeletonization   sketch   skin   sky   slam   smartphone   soccer   social   software   source   space   spain   spanish   speaker   speech   speed   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   structure   structured   study   stuff   style   stylization   subpixel   subtraction   summarization   summary   superpixel   superresolution   supervised   supervisely   surface   surgery   surprise   surveillance   swan   switzerland   sydney   symmetry   synthetic   table   target   taxonomy   temporal   text   textile   texture   texture-less   therapy   thermal   things   time   timelapse   tiny   tokyo   tool   tools   top-view   topcoder   tracking   tracklet   traffic   trajectory   transfer   transportation   trees   triangulation   truth   tuberculosis   turbulence   type   uas   uav   udacity   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vehicles   vessel   video   view   viewpoint   virtual   visible   vision   visual   voc   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wiki   wikipedia   wild   workflow   world   worldwide   xray   year   youtube   zoom   zurich  
«showing 653 tags of 653 total tags for 463 datasets (1.41) »


benchmark
DID Name Description Tags URL Date Views
459 MVSEC The Multi Vehicle Stereo Event Camera dataset is a collection of data designed for the development of novel 3D perception algorithms for event based cameras. St... event camera speed intensity dynamic gps imu 3d benchmark link 2018-05-30 51
455 Darmstadt Noise Dataset The Darmstadt Noise dataset provides a benchmark for denoising performance. Lacking realistic ground truth data, image denoising techniques are traditionally e... noise denoising benchmark high-resolution groundtruth iso natural real link 2018-04-18 96
454 SBM-RGBD Dataset The SBM-RGBD dataset [provides] all facilities (data, ground truths, and evaluation scripts) in order to evaluate and compare scene background modelling metho... background modeling rgbd kinect video color depth benchmark indoor surveillance link 2018-04-18 114
449 HOWTO Create Dataset Our catalog of challenges for CV algorithms creates a basis for referencing criticalities by name and allows the calculation of criticality coverage. It is a si... benchmark dataset howto link 2018-04-11 75
448 EPIC-KITCHENS EPIC-KITCHENS, is the largest egocentric video benchmark recorded by 32 participants in their native kitchen environments. Our videos depict non-scripted daily ... action egocentric video benchmark kitchen cooking food activity daily worldwide link 2018-04-11 83
447 WIKI List A list of machine learning datasets ... benchmark dataset wiki aerial machine learning link 2018-04-19 96
446 DAVIS: Densely Annotated VIdeo Segmentation 2016 Dataset released with the CVPR 2016 paper. The videos contain several types of objects and humans with a high quality segmentation annotation. In each video seq... object tracking segmentation video benchmark code hd quality resolution link 2018-04-05 101
443 ApolloScape Semantic Segmentation The ApolloScape Parsing dataset is provided by Baidu for the CVPR 2018 Workshop on Autonomous Driving Challenge. It is expected that the Scene Parsing dataset ... segmentation semantic scene benchmark size urban autonomous driving camera calibration link 2018-04-25 186
442 YouTube Co-localization Dataset (ECCV + IEEE Trans. CSVT papers) [GEU and NTU] The dataset consists of bounding box annotations for 15k frames of videos collected from YouTube Objects Dataset. If you find this dataset useful, kindly ci... Co-localization Co-segmentation Co-saliency Video CATS Tracklet Benchmark Binary Object Retrieval Segmentation Semantic Similarity Tracking Matching Localization link 2018-03-21 167
422 Air-Ground-KITTI HD Maps Air-Ground-KITTI dataset consist of annotated aerial and ground images used in the experiments is provided at downloads. Examples of the dataset. Left: aeria... segmentation benchmark aerial urban satellite street road kitti hd link 2018-04-30 330
396 ADE20k Scene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. mages ... segmentation semantic annotation benchmark scene recognition link 2017-08-03 332
389 action recognition benchmark We wanted to have a collection of action recognition papers and results that everybody can use for reference. The site will work by the community principle, so ... action recognition benchmark dataset link 2017-07-11 295
377 Lane Level Localization on a 3D Map The Lane Level Localization dataset was collected on a highway in San Francisco with the following properties: * Reasonable traffic * Multiple lane highway ... 3d map localization autonomous car driving gps benchmark video road link 2017-05-10 433
373 DAVIS: Densely Annotated VIdeo Segmentation 2017 We present the 2017 DAVIS Challenge, a public competition specifically designed for the task of video object segmentation. Following the footsteps of other succ... object tracking segmentation video benchmark code hd quality resolution link 2018-04-05 378
353 COCO-Stuff COCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks like sema... semantic segmentation stuff things COCO captioning annotation groundtruth benchmark link 2017-02-16 821
336 Procedural texture perceptual similarity The procedural texture perceptual similarity dataset contains a list of procedural textures along with their pairwise distances, as defined by a perceptual stud... texture procedural benchmark study link 2016-09-21 451
316 Extreme Classification Repository The Extreme Classification Repository: Multi-label Datasets & Code Kush Bhatia Kunal Dahiya Himanshu Jain Yashoteja Prabhu Manik Varma The objecti... machine learning multilabel classification benchmark evaluation link 2018-03-19 1045
306 Shadow Removal Dataset and Online Benchmark for Variable Scene Categories (University of Bath, Bath) To encourage the open comparison of single image shadow removal in community, we provide an online benchmark site and a dataset. Our quantitatively verified hig... shadow removal benchmark illumination singleview link 2017-12-02 1080
303 1DSfM Landmarks The 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler ground tru... 3d reconstruction landmark groundtruth benchmark urban city link 2015-08-05 958
298 Freiburg-Berkeley Motion Segmentation The Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames is anno... video segmentation benchmark object tracking pedestrian groundtruth motion link 2017-03-21 1226
297 Berkeley Video Segmentation The Berkeley Video Segmentation Dataset (BVSD) contains videos for segmentation (boundary?) Dataset train Dataset test... video segmentation benchmark link 2015-07-14 934
296 Video Segmentation Benchmark The Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divided into... video segmentation benchmark object tracking pedestrian groundtruth motion link 2017-03-21 1363
289 ETHZ CVL Clust MICCAI 2015 Challenge on Liver Ultrasound Tracking Munich, October 9, 2015 (Full Day) Outline Ultrasound (US) imaging is a widely used medical imaging techn... medical liver tracking ultrasound therapy human organ benchmark real link 2015-06-19 745
286 HDA Person Dataset - ISR Lisbon The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestrian De... Video Surveillance Pedestrian Detection Re-Identification Multiview Tracking Benchmark Indoor High-Definition Camera Network lisbon human link 2017-10-02 2304
285 ISPRS-EuroSDR Multi-Platform ISPRS / EuroSDR Benchmark for Multi-Platform Photogrammetry In these pages you can get information about the BENCHMARK FOR MULTI-PLATFORM PHOTOGRAMMETRY unde... aerial multiview 3d photogrammetry germany switzerland urban city benchmark reconstruction link 2015-06-16 835
283 ISPRS WG III/4 ISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get further inform... aerial multiview 3d photogrammetry germany canada semantic segmentation urban city recognition benchmark link 2015-06-16 890
282 ISPRS-EuroSDR HighDensity ISPRS and EuroSDR - Benchmark on High Density Aerial Image Matching Background and Scope of the project Innovations in matching algorithms as well as the... aerial multiview 3d photogrammetry germany switzerland urban city benchmark reconstruction link 2015-06-16 818
273 SBMI 2015 Scene Background Initialization (SBI) dataset The SBI dataset has been assembled in order to evaluate and compare the results of background initialization al... change detection background initialization foreground benchmark link 2015-05-02 719
259 MOT Challenge 2D and 3D The MOT Challenge is a framework for the fair evaluation of multiple people tracking algorithms. In this framework we provide: - A large collection of datase... 3d tracking multiple target benchmark dataset people pedestrian surveillance video link 2015-07-31 1520
251 ETHZ CVL RueMonge 2014 This ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1] and p... semantic segmentation 3d reconstruction architecture paris benchmark source code urban recognition classification outdoor pointcloud mesh link 2014-11-24 1661
248 VIDEO datasets overview Many different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a handy spread... video benchmark recognition classification detection object action link 2018-04-23 1345
245 ETHZ CVL Video SumMe The Video Summarization (SumMe) dataset consists of 25 videos, each annotated with at least 15 human summaries (390 in total). The data consists of videos, anno... video summary benchmark human groundtruth action event link 2016-10-21 2129
240 Microsoft COCO The Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other features: Mo... object context segmentation detection recognition benchmark semantic link 2015-05-02 1688
233 PASCAL Context We would like to announce the release of PASCAL-Context dataset. We augmented PASCAL VOC 2010 dataset with annotations for 400+ additional categories. In the cu... semantic segmentation pascal benchmark category recognition dense shape link 2014-07-17 1145
230 FGVC-Aircraft Fine-Grained Visual Classification of Aircraft (FGVC-Aircraft) is a benchmark dataset for the fine grained visual categorization of aircraft. Data, annotatio... fine-grained classification recognition benchmark evaluation aircraft airplane link 2018-06-07 1935
223 SHOT 3D shape description The 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some of the m... 3d shape description benchmark reconstruction registration matching link 2015-06-21 1164
213 ChairGest Gestures ChairGest is an open challenge / benchmark. The task consists in spotting and recognizing gestures from multiple synchronized sensors: 1 Kinect and 4 Xsens Ine... benchmark recognition kinect gesture detection human link 2014-06-06 858
194 HCI 4D Lightfields The HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below. For ma... 3d 4d lightfield benchmark depth reconstruction evaluation link 2017-04-28 1564
177 SIPI textures The Textures volume currently contains 154 images, all monochrome, 129 512x512 and 25 1024x1024. For the Brodatz texture images, the number in parenthesis (i... texture, segmentation, classification, benchmark, synthetic, evaluation link 2013-08-20 1197
176 Brodatz Album The Brodatz dataset consists of 112 textures in grayscale images of various texture types. http://www.ee.oulu.fi/research/imag/texture/image_data/Brodatz32.h... texture, segmentation, classification, benchmark, synthetic link 2014-12-23 1443
175 Outex texture bench The Outex dataset is part of a framework for empirical evaluation of texture classification and segmentation algorithms. The framework is being constructed acc... texture, segmentation, classification, benchmark, synthetic link 2015-11-17 901
73 Strecha Dense MVS An evaluation benchmark for dense MVS for these datasets fountain-P11, Herz-Jesu-P8, entry-P10, castle-P19, Herz-Jesu-P25, castle-P30 . Images (corrected for... sfm, reconstruction, benchmark, depth, dense, mesh link 2017-12-11 2140
67 Middlebury MVS Dino The object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may wish to... sfm, reconstruction, benchmark, multiview, 3d, link 2013-09-20 1110
66 Middlebury MVS Temple The object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution of ground... sfm, reconstruction, benchmark, multiview, 3d, link 2013-09-20 999
55 Prague Texture Segmentation The Prague Texture Segmentation Datagenerator and Benchmark is designed to mutually compare and rank different (dynamic/static) texture segmenters (supervised o... texture, segmentation, classification, benchmark, synthetic link 2013-08-08 920
52 Graffiti The Graffiti dataset by Krystian Mikolajczyk and Cordelia Schmid contains 48 images split into 8 sequences with 6 images each showing different structured and t... feature, detection, description, rectification, benchmark link 2017-02-23 1012


total views: 41657 5 queries in 9.9897384643555E-5s 0.00012898445129395s 0.0001678466796875s 1.9073486328125E-5s 0.0011439323425293s and total 0.0068428516387939s