Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   accuracy   action   activity   address   adhead   adjustment   adult   aerial   aesthetics   affordance   age   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulation   artificial   aspect   atmospheric   attention   attribute   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   bark   baseline   behavior   belgium   benchmark   benchmarking   berlin   bike   bilateral   bim   binary   biology   biometric   biometry   blender   blur   boat   body   bone   bottle   boundingbox   brain   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   california   caltech   camera   canada   caption   captioning   capture   car   cardinal   categorization   category   cats   cbir   celebrity   cell   centered   chair   challenge   change   chemistry   chest   chicaco   chromaticity   church   circle   city   cityscapes   classification   clinical   clothing   cloud   clustering   clutter   cnn   co-localization   co-saliency   co-segmentation   co-skeletonization   coco   code   codebook   coffee   collaborative   color   community   comparison   computer   condition   constancy   context   contour   cooking   copyright   counting   cover   cow   crepe   crf   crop   cross-view   crowd   ct   cutting   daily   dance   dark   data   dataset   day   daylight   decomposition   deep   defocus   deformation   denoising   dense   depth   description   descriptor   detail   detection   dichromatic   disease   disgust   disparity   dna   dogs   domain   dped   driving   drone   dubrovnik   duplicate   dynamic   ear   edge   egocentric   ellipse   emotion   empty   endtoend   enhancement   environment   estimation   evaluation   event   expertise   expression   eye   facade   face   facial   fake   family   fashion   fear   feature   field   fine-grained   fingerprint   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   fog   food   foot   footprint   foreground   forensics   fov   frames   frontview   fundus   gait   game   gan   gaze   gender   genetic   genome   geography   geometry   geoscience   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   grayscale   graz   ground   groundtruth   group   growth   gsd   hand   handwritten   hd   head   heart   heat   hierarchy   high-definition   high-resolution   highlight   highway   holes   horse   hospital   house   howto   human   identification   illumination   illuminiation   illusion   image   imagenet   images   imdb   imu   indoor   inertial   initialization   inserts   instance   intake   intensity   interaction   interactive   interest   internet   invariance   ir   isar   iso   joy   kaggle   kernels   keyframe   kimia   kinect   kinship   kitchen   kitti   label   labeling   laboratory   land   landmark   lane   language   large   large-scale   laser   lattice   layout   leaf   learning   letter   leuven   lidar   lifespan   light   lightfield   lighting   limited   line   lip   lisbon   liver   local   localization   location   logo   low   lowlevel   machine   makeup   manhattan   map   maritime   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   model   modeling   monitoring   mono   montage   motion   motorbike   mouse   mouth   movement   movie   mpeg   mser   mug   multi-camera   multi-class   multi-human   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multimedia   multimodal   multiple   multispectral   multitarget   multiview   naming   natural   nature   navigation   netherlands   network   neutral   newyork   night   nir   noise   normal   nude   number   object   occlusion   ocr   odometry   omnidirection   omnidirectional   online   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   pan   panchromatic   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   pedestrians   people   person   perspective   phase   photo   photogrammetry   physics   pittsburgh   place   plane   planning   plant   point   pointcloud   polygon   popularity   pornography   pose   potsdam   presentation   pressure   primitive   privacy   procedural   profile   project   proposal   pruning   ptz   quality   question   radar   random   rank   ranking   ransac   rate   ratio   re-identification   reading   real   real-world   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   relationship   remote   removal   rendering   repetition   resolution   restoration   retina   retinal   retrieval   rgb   rgbd   road   robot   robotic   robust   rome   room   ros   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   search   segmentation   selfdriving   semantic   sense   sensing   sequence   series   sfm   shadow   shape   sheffield   shoe   shots   shutter   sideview   sign   signs   similarity   simultaneous   single   singleview   size   skeleton   skeletonization   sketch   skin   sky   slam   smartphone   soccer   social   software   source   space   spain   spanish   speaker   speech   speed   sphere   sport   stability   stabilization   static   stationary   steganalysis   steganography   stereo   stereovision   stochastic   street   structure   structured   study   stuff   style   stylization   subpixel   subtraction   summarization   summary   superpixel   superresolution   supervised   supervisely   surface   surgery   surprise   surveillance   swan   switzerland   sydney   symmetry   synthetic   table   target   task   taxonomy   temporal   text   textile   texture   texture-less   therapy   thermal   things   time   timelapse   tiny   tokyo   tool   tools   top-view   topcoder   tracking   tracklet   traffic   trajectory   transfer   transportation   trees   triangulation   truth   tuberculosis   turbulence   type   uas   uav   udacity   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vegetation   vehicle   vehicles   vessel   video   view   viewpoint   virtual   visible   vision   visual   voc   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wiki   wikipedia   wild   workflow   world   worldwide   xray   year   youtube   zoom   zurich  
«showing 665 tags of 665 total tags for 470 datasets (1.41) »


image
DID Name Description Tags URL Date Views
469 ALASKA ALASKA is the second contest on steganalysis ; after a fruitful first contest, called BOSS and organized in 2010, which give birth to the development of large f... steganalysis steganography image recognition challenge forensics link 2018-09-07 5
460 Exclusively-Dark-Image-Dataset In order to facilitate a new object detection and image enhancement research, we introduce the Exclusively Dark (ExDark) dataset (CVIU - under review). The Excl... object detection, low light dark images, image enhancement link 2018-06-26 124
453 San Diego State University - Open Turbulent Image Set (OTIS) Despite the existence of several turbulence mitigation algorithms in the literature, no common dataset exists to objectively evaluate their efficiency. This dat... Image Sequence Atmospheric Turbulence Restoration Evaluation link 2018-04-16 71
432 Collaborative 3D reconstruction with smartphones collaborative 3d reconstruction with smartphones dataset: Six off-the-shelf Android smartphones captured video streams (Table 1, see below) of three cultural h... collaborative 3d reconstruction smartphone image cloud video link 2018-03-15 103
427 CITY-OSM - ETH Zurich # Learning Aerial Image Segmentation From Online Maps This is the ground truth data generated for the publication Learning Aerial Image Segmentation F... semantic computer vision aerial image segmentation map geoscience remote sensing deep learning berlin chicaco paris potsdam tokyo zurich link 2018-01-25 257
424 Automatic Image Cropping The Automatic Image Cropping dataset contains ill-composed images with manual crops provided by qualified experts. As described in Section 2.1, our visual co... image crop automatic aesthetics multimedia link 2018-01-10 199
413 DPED: DSLR Photo Enhancement Dataset We introduce a large-scale DPED dataset that consists of photos taken synchronously in the wild by three smartphones and one DSLR camera. The devices used to co... dped image photo enhancement deep learning computer vision link 2017-10-24 219
395 AWS Public Datasets AWS hosts a variety of public datasets that anyone can access for free. Previously, large datasets such as satellite imagery or genomic data have required hour... amazon aerial classification deep learning segmentation recognition satellite human biology space image resolution link 2018-04-30 555
393 ZuBuD+ ZuBuD+, created in February 2017 by Federico Magliani (University of Parma), introduces many query images balancing the class evaluated from the previous datase... landmark, building, image retrieval, urban link 2017-07-17 304
388 Open Images Dataset v4 new Today, we introduce Open Images, a dataset consisting of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories. We tried ... classification large-scale category real image deep annotation automatic benchmark boundingbox link 2018-09-11 512
380 CERTH Image Blur Dataset The CERTH image blur dataset consists of 2450 digital images, 1850 out of which are photographs captured by various camera models in different shooting conditio... blur motion defocus detection quality image link 2018-07-23 515
354 Facial Expression Research Group Database (FERG-DB), University of Washington, Seattle FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The chara... Face, Facial expression, Animation, Stylization, annotation emotion, deep learning, anger, sad, joy, disgust, surprise, neutral, fear, cardinal classification, human transfer, image retrieval link 2017-02-27 901
343 FIRE Fundus Image Registration Dataset A benchmark dataset for the evaluation of retinal image registration methods is introduced. The dataset consists on 134 image pairs and is annotated with ground... retina retinal image registration fundus eye link 2016-10-17 607
335 General 100 General-100 dataset contains 100 bmp-format images (with no compression). We used this dataset in our FSRCNN ECCV 2016 paper. The size of these 100 images range... image superresolution link 2017-07-22 954
325 Synthesized Inverse Synthetic Aperture Radar (ISAR) Images of Aircrafts The database contains synthesized inverse synthetic aperture radar images of seven aircraft models. Reference: Hari Kishan Kondaveeti, Valli Kumari Va... ISAR, image, classification link 2018-08-29 857
313 Automotive Multi-sensor (AMUSE) The automotive multi-sensor (AMUSE) dataset consists of inertial and other complementary sensor data combined with monocular, omnidirectional, high frame rate v... street urban inertial video image traffic city api link 2017-11-28 1091
309 Coutour patches The contour patches dataset is a large dataset of images patch matches used for contour detection. References: C. L. Zitnick and D. Parikh The Role of Im... patch image match contour edge lowlevel detection segmentation link 2015-09-29 744
280 Yahoo Flickr Creative Commons 100M Yahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. All the ... flickr landmark image recognition detection reconstruction 3d clustering social community internet link 2015-09-24 1229
236 iCoseg dataset iCoseg dataset introduces the largest publicly available co-segmentation dataset of 38 groups (643 images), along with pixel ground-truth hand annotations.... image co-segmentation link 2017-06-22 1305
209 Symmetry Set The Symmetry set dataset is a collection of images at different illuminations for the purpose of image matching using local symmetry features. Image Matching... symmetry matching feature image illumination lighting urban building link 2017-05-03 1110
147 FlickrLogos-32 The FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval methods... flickr, logo, detection, retrieval, image, object recognition, machine learning, classification brand boundingbox link 2018-03-08 1554
44 UK Bench The UK Bench dataset from Henrik Stewenius and David Nister contains 10200 images of N=2550 groups with each four images at size 640x480. The images are rotated... retrieval image object centered rotation link 2018-07-11 2645
20 CALTECH 256 The CALTECH 256 dataset by Li Fei-Fei contains 30607 images for 256 categories.... classification centered object scene image link 2013-08-08 1035
19 CALTECH 101 The CALTECH 101 dataset by Li Fei-Fei contains images for 101 categories with about 40 to 800 images per category. Most categories have about 50 images at rough... classification centered object scene image link 2013-08-08 1030


total views: 17926 5 queries in 0.00012588500976562s 0.00011897087097168s 0.00018787384033203s 0.00010490417480469s 0.001417875289917s and total 0.0084578990936279s