Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   action   actions   activities   activity   address   adhead   adjustment   aerial   aesthetics   age   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   and   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulation   aspect   attention   attribute   attributes   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   bike   bilateral   binary   biology   biometric   biometry   blender   blur   boat   body   bone   bottle   boundingbox   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   california   caltech   camera   canada   captioning   captions   capture   car   cardinal   categorization   category   celebrity   cell   centered   chair   challenge   change   chemistry   chest   chromaticity   church   circle   cities   city   classification   clothing   clustering   clutter   cnn   co-segmentation   co-skeletonization   coco   code   codebook   coffee   color   community   comparison   computer   conditions   constancy   context   contour   cooking   copyright   cosegmentation   counting   cover   cow   crepe   cross-view   crowd   ct   cutting   daily   dance   data   dataset   day   daylight   decomposition   deep   defocus   deformation   dense   depth   description   descriptor   detail   detection   dichromatic   disgust   disparity   dogs   domain   dped   driving   drone   dubrovnik   duplicate   dynamic   ear   edge   egocentric   ellipse   emotion   endtoend   enhancement   estimation   evaluation   event   expression   eye   facade   face   facial   fashion   fear   feature   field   fine-grained   fingerprint   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   food   foot   footprint   foreground   fov   frames   frontview   fundus   gait   game   gan   gaze   gender   genetic   genome   geography   geometry   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   graz   ground   groundtruth   group   gsd   hand   handwritten   hd   head   heart   heat   hierarchy   high-definition   highlight   highway   holes   horse   house   human   humans   identification   illumination   image   imagenet   images   imdb   indoor   inertial   initialization   inserts   instance   intake   interaction   interactive   interest   internet   invariance   ir   isar   joy   kernels   keyframe   kimia   kinect   label   labeling   laboratory   land   landmark   lane   language   large   large-scale   laser   lattice   layout   learning   letter   leuven   lidar   light   lightfield   lighting   limited   line   lip   lisbon   liver   local   localization   location   logo   lowlevel   machine   manhattan   map   maritime   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   model   modeling   modelling   monitoring   mono   montage   motion   motion-capture-data   motorbike   mouse   mouth   movement   movie   mpeg   mug   multi-camera   multi-class   multi-human   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multimodal   multiple   multitarget   multiview   naming   natural   nature   navigation   network   neutral   newyork   night   nir   noise   normal   nude   number   object   objects   occlusion   ocr   odometry   omnidirection   omnidirectional   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   pan   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   people   person   perspective   phase   photo   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   polygon   popularity   pornography   pose   presentation   pressure   primitive   privacy   procedural   profile   proposal   ptz   quality   question   radar   random   rank   ranking   ransac   rate   ratio   re-identification   reading   real   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   reidentification   remote   removal   rendering   repetition   resolution   retina   retinal   retrieval   rgb   rgb-d   rgbd   road   robot   robust   rome   room   ros   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   scenes   search   segmentation   selfdriving   semantic   sense   sensing   sequence   sfm   shadow   shadows   shape   sheffield   shoes   shots   shutter   sideview   sign   similarity   simultaneous   single   singleview   skeleton   skeletonization   sketch   skin   sky   slam   soccer   social   software   source   space   spain   spanish   speaker   speech   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   structure   structured   study   stuff   stylization   subpixel   subtraction   summarization   summary   superpixel   superresolution   supervised   surface   surgery   surprise   surveillance   swan   switzerland   sydney   symmetry   synthetic   table   target   taxonomy   temporal   text   texture   texture-less   therapy   thermal   things   time   time-series   tiny   tool   tools   top-view   tracking   traffic   trajectory   transfer   transportation   triangulation   truth   tuberculosis   type   uas   uav   udacity   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vessel   video   view   viewpoint   visible   vision   visual   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wikipedia   wild   workflow   world   xray   year   zoom   zurich  
«showing 591 tags of 591 total tags for 421 datasets (1.4) »


face
DID Name Description Tags URL Date Views
417 Visual Lip Reading Feasibility (VRLF) The VLRF database is designed with the aim to contribute to research in visual only speech recognition. A key difference of the VLRF database with respect to ex... lip reading recognition speaker spanish language mouth face speech link 2017-11-07 31
412 MegaAge Dataset We introduce a new large-scale MegaAge dataset that consists of 41,941 faces annotated with age posterior distributions. We also provide the MegaAge-Asian datas... Face Analysis, Age Estimation link 2017-10-12 78
402 GeoFaces A large dataset of geotagged face images collected from Flickr. The zip file contains text files containing urls of the images. Face2GPS: Estimating Geograph... face localization geotagged classification gender age human link 2017-09-06 109
364 ETH CVL IMDB WIKI Faces Since the publicly available face image datasets are often of small to medium size, rarely exceeding tens of thousands of images, and often without age informat... face imdb wikipedia detection recognition age biometry link 2017-02-22 334
355 IMPART multi-modal/multi-view The multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. The sourc... multi-view multi-mode video rgbd lidar 3d model color indoor outdoor dynamic action face human emotion link 2017-01-01 368
354 Facial Expression Research Group Database (FERG-DB), University of Washington, Seattle FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The chara... Face, Facial expression, Animation, Stylization, annotation emotion, deep learning, anger, sad, joy, disgust, surprise, neutral, fear, cardinal classification, human transfer, image retrieval link 2017-02-27 519
345 MMSE Heartrate The MMSE heart rate dataset measures the visual heart rate from. faces by throwing darts at people. ... face landmark emotion heart rate biology n/a 2016-10-21 491
340 Ljubljana CVL Face Database Database contains 798 images of 114 persons, with 7 images per person and is freely available for research purposes. All images were taken in supervised conditi... face pedestrian person recognition biometry human illumination lighting link 2017-02-22 461
329 Virginia Tech and Arab Academy for Science & Technology (VT-AAST) The VT-AAST Benchmarking Dataset A New Color Image Database for Benchmarking of Face Detection Techniques and Human Skin Segmentation Techniques​. A new color face image database for ... face, detection, skin, segmentation, benchmarking, link 2016-07-11 569
314 WIDER FACE: A Face Detection Benchmark WIDER FACE dataset is a large-scale face detection benchmark dataset with 32,203 images and 393,703 face annotations, which have high degree of variabilities in... face detection scale pose occlusion link 2016-02-11 1010
310 FASSEG - FAce Semantic Segmentation The FAce Semantic SEGmentation (FASSEG) repository contains datasets for multi-class semantic face segmentation. The FASSEG repository is composed by two dat... face, segmentation link 2017-04-04 1013
290 UWO GCO Volume Segmentation The Western GCO Segmentation problem instances are provided to compare effects of graph size, neighborhood size, length of s to t paths, regional arc consistenc... medical liver babyface bone abdomen adhead face segmentation binary optimization link 2015-06-19 537
261 MPI Multi-View Collection GVV datasets Welcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects of the G... video multiview tracking face mesh reconstruction depth human action pose link 2014-12-10 760
257 FaceScrub The FaceScrub dataset comprises a total of 107818 unconstrained face images of 530 celebrities crawled from the Internet, with about 200 images per person. M... face detection recognition celebrity people human link 2017-11-12 938
256 Multi-Task Facial Landmark (MTFL) dataset This dataset contains 12,995 face images which are annotated with (1) five facial landmarks, (2) attributes of gender, smiling, wearing glasses, and head pose. ... face, landmark detection, deep learning, cnn, attribute link 2015-11-07 1757
254 ChokePoint Dataset We collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions using e... human pedestrian identification recognition multiview sequence face detection real world surveillance clustering link 2015-05-02 1208
220 3D Mask Attack Dataset The 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for both re... 3d biometry face recognition segmentation frontview emotion link 2016-03-14 1087
211 POSTECH Labeled Faces in the Wild POS Labeled Faces in the Wild, a collection of face which is proposed for studying face identification in unconstrained environment, its purpose is serving as a... face identification wild recognition registration link 2015-09-10 1165
192 Our Database of Faces The Our Database of Faces (ORL) dataset contains ten different images of each of 40 distinct subjects. For some subjects, the images were taken at different tim... face recognition illumination human expression link 2013-09-23 981
161 ICG Annotated Facial Landmarks in the Wild (AFLW) The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet... face detection landmark pose age annotation link 2017-07-25 2211
51 PN Learning PN Learning - How does TLD work? Tracking estimates the object location as long as the object is visible. During tracking all observed patterns of the object... single target tracking learning object pedestrian bike face link 2017-11-28 741
50 Babenko tracking The Babenko tracking dataset contains 12 video sequences for single object tracking. For each clip they provide (1) a directory with the original image s... tracking single object animal face occlusion video link 2016-08-08 2390
29 The Yale Face The Yale Face dataset from A. Georghiades contains 5760 single light source images of ten subjects, each shown in 9 poses and 64 illumination setups (leading to... face, pedestrian, detection, pose, illumination link 2015-06-23 841
28 CMU Faces - Frontal faces The MIT + CMU frontal face dataset from H. Rowley contains 130 images with 507 labeled frontal faces from movie, portrait and media sources. It is mostly graysc... frontview, face, detection object boundingbox link 2015-06-19 874
27 Idiap/ETHZ Faces and Poses Idiap/ETHZ Faces and Poses Dataset dataset by L. Jie, B. Caputo and V. Ferrari contains 1703 image-caption pairs. [author] Captions contain the names of some of... face, pose, pedestrian, text link 2013-03-11 835


total views: 21308 5 queries in 0.00014495849609375s 0.00011920928955078s 0.00017809867858887s 5.8889389038086E-5s 0.001270055770874s and total 0.0076830387115479s