Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   action   activity   address   adhead   adjustment   aerial   aesthetic   aesthetics   age   aic   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   and   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulated   aspect   attention   attribute   attributes   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   bike   bilateral   binary   biology   biometric   biometry   blender   blur   body   bone   bottle   boundingbox   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   caltech   camera   canada   captioning   capture   car   cardinal   categorization   category   celebrity   cell   centered   chair   challenge   change   chemistry   chest   chromaticity   church   circle   cities   city   classification   clothing   clustering   clutter   cnn   co-segmentation   coco   code   codebook   coffee   color   community   comparison   conditions   constancy   context   contour   cooking   copyright   cosegmentation   counting   cover   cow   crepe   cross-view   crowd   ct   cutting   dance   data   dataset   day   decomposition   deep   defocus   deformation   dense   depth   description   descriptor   detail   detection   dichromatic   disgust   disparity   dogs   domain   driving   dubrovnik   duplicate   dynamic   ear   ecocentric   edge   egocentric   ellipses   emotion   endtoend   estimation   evaluation   event   expression   eye   facade   face   facial   fear   feature   field   fine-grained   fingerprints   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   food   foot   foreground   fov   frames   frontview   fundus   gait   game   gaze   gender   genetic   genome   geography   geometry   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   graz   ground   groundtruth   group   hand   hands   handwritten   hd   head   heart   heat   hierarchy   high-definition   highlight   highway   holes   horse   human   identification   illumination   image   imagenet   images   imdb   indoor   inertial   initialization   inserts   instance   intake   interaction   interactive   interest   internet   invariance   ir   isar   joy   kernels   keyframe   kimia   kinect   label   labeling   laboratory   landmark   lane   language   large   large-scale   laser   lattice   layout   learning   letter   leuven   lidar   light   lightfield   lighting   limited   line   lisbon   liver   local   localization   location   logo   lowlevel   machine   manhattan   map   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   model   modeling   modelling   monitoring   mono   montage   motion   motion-capture-data   motorbike   mouse   movement   movie   mpeg   mug   multi-camera   multi-class   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multiple   multitarget   multiview   naming   natural   nature   navigation   network   neutral   newyork   night   noise   normal   nude   number   object   objects   occlusion   ocr   odometry   omnidirection   omnidirectional   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   people   person   perspective   phase   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   polygon   popularity   pornography   pose   presentation   pressure   primitive   privacy   procedural   profile   proposal   ptz   quality   question   radar   randomnoise   rank   ranking   ransac   rate   ratio   re-identification   real   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   reidentification   remote   removal   rendering   repetition   resolution   retina   retinal   retrieval   rgb   rgbd   road   robot   robust   rome   room   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   scenes   search   segmentation   semantic   sense   sensing   sequence   sfm   shadow   shadows   shape   shapes   sheffield   shoes   shots   shutter   sideview   sign   similarity   simultaneous   single   singletarget   singleview   skeleton   sketch   skin   sky   slam   soccer   social   software   source   space   spain   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   streetside   streetview   structure   structure-from-motion   structured   structures   study   stuff   stylization   subpixel   subtraction   summarization   summary   superresolution   supervised   surface   surgery   surprise   surveillance   swan   switzerland   symmetry   synthetic   table   target   taxonomy   temporal   text   texture   texture-less   therapy   thermal   things   time   time-series   tiny   tool   tools   top-view   tracking   traffic   trajectory   transfer   transportation   triangulation   truth   tuberculosis   type   uas   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vehicles   video   videosurveillance   view   viewpoint   vision   visual   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wikipedia   wild   workflow   world   xray   year   zoom   zurich  
«showing 562 tags of 562 total tags for 411 datasets (1.37) »


retrieval
DID Name Description Tags URL Date Views
393 ZuBuD+ ZuBuD+, created in February 2017 by Federico Magliani (University of Parma), introduces many query images balancing the class evaluated from the previous datase... landmark, building, image retrieval, urban link 2017-07-17 78
354 Facial Expression Research Group Database (FERG-DB), University of Washington, Seattle FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The chara... Face, Facial expression, Animation, Stylization, annotation emotion, deep learning, anger, sad, joy, disgust, surprise, neutral, fear, cardinal classification, human transfer, image retrieval link 2017-02-27 445
321 Webcam Interestingness The Webcam Interestingness dataset consists of 20 different webcam streams, with 159 images each. It is annotated with interestingness ground truth, acquired in... webcam interest classification retrieval ranking video weather link 2016-03-02 528
320 San Francisco Landmark Dataset for Mobile Landmark Recognition The San Francisco Landmark Dataset for Mobile Landmark Recognition is a set of images and query images for localization. We present the San Francisco Landmar... retrieval localization city urban sanfrancisco landmark calibration gps mobile link 2016-03-04 599
319 Visual Search Patches The Compact Descriptors for Visual Search Patches Dataset (CDVS) is a dataset comprised of pairwise image patches. MPEG is a standard titled Compact Descriptor... patch matching retrieval descriptor feature mpeg link 2016-02-11 461
293 Google Street View Localization The Google Street View dataset contains 62,058 high quality Google Street View images. The images cover the downtown and neighboring areas of Pittsburgh, PA; Or... localization retrieval gps google streetview urban panorama pittsburgh address manhattan sphere link 2015-06-24 769
277 Detail 2D Projection DataSet Detail 2D Projection DataSet is a database of 2d projections of mechanical details with holes. The dataset consists of 13 shape categories where each category i... shape, holes, detail, binary, matching, retrieval link 2015-05-10 536
225 California-ND An Annotated Dataset For Near-Duplicate Detection In Personal Photo Collections Managing photo collections involves a variety of image quality assessment tas... retrieval duplicate copyright groundtruth detection link 2014-03-19 704
200 Landmark 3D This dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope it coul... landmark recognition classification retrieval 3d reconstruction codebook matching feature flickr link 2016-08-09 1032
199 THUR15000 We introduce a labeled dataset of categorized images for evaluating sketch based image retrieval. Using Flickr, we downloaded about 3000 images for each of the ... group saliency object detection visual attention sketch shape retrieval internet link 2013-10-08 915
159 Caltech Game Covers Dataset The Caltech Game Covers dataset consists of CD/DVD covers of video games. The set was downloaded from freecovers.net during the summer of 2008. The set includes... classification retrieval game cover caltech hierarchy taxonomy link 2014-02-20 723
158 Caltech Buildings Dataset The Caltech Buildings dataset consists of images taken for 50 buildings around the Caltech campus. Five different images were taken for each building from diffe... building urban retrieval hierarchy taxonomy caltech link 2013-08-08 755
147 FlickrLogos-32 The FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval methods... flickr, logo, detection, retrieval, image, object recognition, machine learning, classification brand boundingbox link 2017-05-27 1145
116 Sheffield Building Sheffield Building Image Dataset consists of over 3,000 low-resolution images of forty different buildings typically between 70 and 120 images per building. T... retrieval, classification, urban, sheffield link 2013-03-12 750
115 Pankrac Marseille Our repetitive pattern dataset with 106 images of app. 30 buildings from Pankrac, Prague and Marseille appearing in more than one image, number of appearances r... classification, retrieval, symmetry, repetition, urban link 2013-03-13 680
110 EITZ Sketch Quality Humans have used sketching to depict our visual world since prehistoric times. Even today, sketching is possibly the only rendering technique readily available ... shape, matching, retrieval, partial, sketch link 2014-02-11 709
109 EITZ Sketch-Based Image Retrieval We introduce a benchmark for evaluating the performance of large scale sketch-based image retrieval systems. The necessary data is acquired in a controlled user... shape, matching, retrieval, partial, sketch link 2014-02-11 695
108 ICG Sketch Retrieval The ICG Sketch Retrieval dataset consists of XXX hand-drawn sketches for five categories. It is used for content-based image retrieval using shape features for ... shape, matching, retrieval, partial, sketch n/a 2014-02-11 736
103 COIL-100 The COIL-100 (Columbia University Image Library) consists of 100 objects. For formal documentation look at the corresponding compressed technical report, [gzipp... classification, retrieval link 2013-03-12 736
102 Tiny Images The Tiny Images dataset consists of 79,302,017 images, each being a 32x32 color image. This data is stored in the form of large binary files which can be accese... classification, tiny, color, retrieval link 2013-03-12 727
85 Leaves The Leaves dataset from X contains X images of leaves. Leaves dataset taken by Markus Weber. California Institute of Technology PhD student under Pietro Per... shape, binary, matching, retrieval, partial n/a 2015-12-25 860
84 Aachen Retrieval The Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with their SIFT ... retrieval, aachen, landmark, sfm, reconstruction link 2013-03-11 909
63 Paris500k The Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box rather ... retrieval, paris, landmark, geotag, flickr, panoramio, sfm, reconstruction link 2016-12-23 1117
47 CMP Retrieval CMP Dataset by Ondra Chum contains 5 million images collected from the internet.... retrieval, urban, large scale link 2013-03-11 707
46 Paris Retrieval The Paris dataset consists of 6412 images. Images have high resolution and are in JPEG format. http://www.robots.ox.ac.uk/~vgg/data/parisbuildings/paris_1.... retrieval, urban, paris, landmark link 2016-10-11 785
45 Oxford Buildings The Oxford Buildings dataset by James Philbin and Andrew Zisserman consists of 5062 images collected from Flickr by searching for particular Oxford landmarks. T... retrieval, urban, oxford, landmark link 2017-04-17 891
44 UK Bench The UK Bench dataset from Henrik Stewenius and David Nister contains 10200 images of N=2550 groups with each four images at size 640x480. The images are rotated... retrieval image object centered rotation link 2017-08-31 1910
43 ZuBud The Zurich Building dataset (ZuBud) from Hao Shao, Tomas Svoboda and Luc Van Gool [?] contains 1005 images with 201 buildings each in five views. There is also ... retrieval, urban, procedural, rectification link 2013-03-11 802
21 ImageNET The ImageNET dataset is the latest dataset by Li Fei-Fei containing various dataset ranging from 1000 to 10000 categories.... retrieval, segmentation, classification link 2013-03-11 931
8 Tools2D The Tools 2D dataset from Bronstein, Bronstein, Bruckstein, and Kimmel [?] for partial similarity experiments and consists of 15 shapes: 5 humans, 5 horses and ... shape, binary, matching, retrieval, partial link 2014-02-11 1051
7 Mythological Creatures The Mythological Creatures consists of articulated shapes (silhouettes) for partial similarity experiments and contains 15 shapes: 5 humans, 5 horses and 5 cent... shape, binary, matching, retrieval, partial, animal link 2015-06-23 1125
6 SIID The SIID silhouette dataset contains... and is from the Shape Indexing of Image Database (SIID). Download SIID silhouette dataset http://www.lems.brown.edu/... shape, binary, matching, retrieval link 2017-03-02 1183
5 KIMA216 The Kimia 216 has 18 classes each consisting of 12 images. It contains shapes silhouettes for birds, bones, brick, camels, car, children, classic cards, elephan... shape, binary, matching, retrieval, kimia, animal link 2017-09-27 1450
4 KIMA99 The Kimia 99 has 9 classes each consisting of each 11 images. They are part of the Shape Indexing of Image Database (SIID) project, which also contains the SIID... shape, binary, matching, retrieval, kimia link 2015-07-29 1115
3 KIMIA25 The Kimia 25 consists of 6 classes and 25 images. They are part of the Shape Indexing of Image Database (SIID) project, which also contains the SIID silhouette ... shape binary matching retrieval kimia link 2015-08-26 920
2 MPEG-7 Core Experiment CE-Shape-1 MPEG-7 Core Experiment CE-Shape-1 [?] is a popular database for shape matching evaluation consisting of 70 shape categories, where each category is represented ... shape, binary, matching, retrieval, bullseye link 2017-03-02 1810


total views: 31289 5 queries in 3.9100646972656E-5s 2.288818359375E-5s 0.00017094612121582s 0.00011110305786133s 0.0017340183258057s and total 0.0076100826263428s