This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 676 tags of 676 total tags for 473 datasets (1.43) »
|463||Families In The Wild (FIW) Database||Families In The Wild (FIW) Database is the largest and most comprehensive database available for kinship recognition. FIW is made up of 11,932 natural family p...||recognition kinship family relationship dna similarity face||link||2018-08-08||58|
|442||YouTube Co-localization Dataset (ECCV + IEEE Trans. CSVT papers) [GEU and NTU]||The dataset consists of bounding box annotations for 15k frames of videos collected from YouTube Objects Dataset. If you find this dataset useful, kindly ci...||Co-localization Co-segmentation Co-saliency Video CATS Tracklet Benchmark Binary Object Retrieval Segmentation Semantic Similarity Tracking Matching Localization||link||2018-03-21||222|
|181||All I Have Seen (AIHS)||The All I Have Seen (AIHS) dataset is created to study the properties of total visual input in humans, for around two weeks Nebojsa Jojic wore a camera capturin...||video summary user study clustering similarity outdoor indoor scene 3d||link||2018-09-19||1022|
|179||CMP Facades||The CMP Facade dataset consists of facade images assembled at the Center for Machine Perception, which includes 600 rectified images of facades from various sou...||facade rectification urban semantic classification recognition structure similarity segmentation||link||2015-06-19||981|
|178||VSUMM (Video SUMMarization)||The VSUMM (Video SUMMarization) dataset is of 50 videos from Open Video. All videos are in MPEG-1 format (30 fps, 352 x 240 pixels), in color and with sound. Th...||video summary type user study keyframe static similarity||link||2015-11-13||1343|