This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 697 tags of 697 total tags for 516 datasets (1.35) »
|417||Visual Lip Reading Feasibility (VRLF)||The VLRF database is designed with the aim to contribute to research in visual only speech recognition. A key difference of the VLRF database with respect to ex...||lip reading recognition speaker spanish language mouth face speech||link||2017-11-07||672|
|414||FashionGAN Dataset||New annotations (languages and segmentation maps) on the subset of the DeepFashion dataset. The data is used in the paper Be Your Own Prada: Fashion Synthes...||GAN, Fashion, Segmentation, Language||link||2020-01-05||1014|
|400||Visual Discriminative Question Generation (VDQG) dataset||The dataset contains 11202 ambiguous image pairs collected from Visual Genome. Each image pair is annotated with 4.6 discriminative questions and 5.9 non-discri...||vision language VQA question genome biology||link||2017-09-12||918|
|154||WordNet||WordNet is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a di...||language, hierarchy, imagenet, classification||link||2013-08-07||1101|