This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 697 tags of 697 total tags for 511 datasets (1.36) »
|475||MAE Dataset||The Multimodal Attribute Extraction (MAE) dataset is the first benchmark dataset for the task of multimodal attribute extraction. It is composed of mixed media ...||multimedia multimodal images text attribute recognition pair product search asset retrieval||link||2018-11-20||237|
|438||CAD 120 affordance||This is the CAD 120 Affordance Segmentation Dataset based on the Cornell Activity Dataset CAD 120 (see http://pr.cs.cornell.edu/humanactivities/data.php). Co...||segmentation affordance action cad attribute human||link||2018-03-15||398|
|337||WIDER Attribute Dataset||WIDER ATTRIBUTE dataset is a human attribute recognition benchmark dataset, of which images are selected from the publicly available WIDER dataset. There are a ...||Attribute recognition, Human attribute||link||2016-09-22||2064|
|328||UT Zappos50K||UT Zappos50K (UT-Zap50K) is a large shoe dataset consisting of 50,025 catalog images collected from Zappos.com. The images are divided into 4 major categories -...||fine-grained, ranking, local learning, pairwise comparison, shoe, attribute||link||2018-09-11||1194|
|278||Comprehensive Cars (CompCars)||The Comprehensive Cars (CompCars) dataset contains data from two scenarios, including images from web-nature and surveillance-nature. The web-nature data contai...||car vehicle recognition attribute classification fine-grained urban object||link||2019-09-28||2856|
|258||Visual Attributes dataset||The Visual Attributes dataset contains visual attribute annotations for over 500 object classes (animate and inanimate) which are all represented in ImageNet. E...||classification recognition attribute imagenet object||link||2019-10-28||1438|
|256||Multi-Task Facial Landmark (MTFL) dataset||This dataset contains 12,995 face images which are annotated with (1) five facial landmarks, (2) attributes of gender, smiling, wearing glasses, and head pose. ...||face, landmark detection, deep learning, cnn, attribute||link||2015-11-07||2868|