This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 641 tags of 641 total tags for 459 datasets (1.4) »
|417||Visual Lip Reading Feasibility (VRLF)||The VLRF database is designed with the aim to contribute to research in visual only speech recognition. A key difference of the VLRF database with respect to ex...||lip reading recognition speaker spanish language mouth face speech||link||2017-11-07||175|
|414||FashionGAN Dataset||New annotations (languages and segmentation maps) on the subset of the DeepFashion dataset. The data is used in the paper Be Your Own Prada: Fashion Synthes...||GAN, Fashion, Segmentation, Language||link||2017-10-30||316|
|400||Visual Discriminative Question Generation (VDQG) dataset||The dataset contains 11202 ambiguous image pairs collected from Visual Genome. Each image pair is annotated with 4.6 discriminative questions and 5.9 non-discri...||vision language VQA question genome biology||link||2017-09-12||371|
|154||WordNet||WordNet is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a di...||language, hierarchy, imagenet, classification||link||2013-08-07||824|