This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 697 tags of 697 total tags for 514 datasets (1.36) »
|462||Taskonomy||The Taskonomy dataset consists of 3.9 Mil. Scenes, 600 Buildings, 25 Tags per Image, 1024 Resolution for taxonomy and transfer learning tasks. We provide a larg...||transfer learning taxonomy task deep indoor 3d mesh pose camera high-resolution||link||2018-08-08||431|
|447||WIKI List||A list of machine learning datasets ...||benchmark dataset wiki aerial machine learning||link||2018-04-19||460|
|427||CITY-OSM - ETH Zurich||# Learning Aerial Image Segmentation From Online Maps This is the ground truth data generated for the publication Learning Aerial Image Segmentation F...||semantic computer vision aerial image segmentation map geoscience remote sensing deep learning berlin chicaco paris potsdam tokyo zurich||link||2018-01-25||707|
|413||DPED: DSLR Photo Enhancement Dataset||We introduce a large-scale DPED dataset that consists of photos taken synchronously in the wild by three smartphones and one DSLR camera. The devices used to co...||dped image photo enhancement deep learning computer vision||link||2017-10-24||574|
|401||Berkeley DeepDrive Video||The Berkeley DeepDrive Video Dataset contains 2x order of magnitude more video training data. Explore 100,000 HD video sequences of over 1,100-hour driving...||urban autonomous driving deep learning endtoend||link||2018-06-26||923|
|399||Osnabrück - Synthetic Scalable Cube Dataset||Voxel Based Dataset for Systematic 3D reconstruction by artificial neural networks (ANNs). A synthetic scalable cube dataset for training, testing and valida...||3D, Deep Learning, Reconstruction, SfM, Synthetic city urban||link||2018-02-13||762|
|395||AWS Public Datasets||AWS hosts a variety of public datasets that anyone can access for free. Previously, large datasets such as satellite imagery or genomic data have required hour...||amazon aerial classification deep learning segmentation recognition satellite human biology space image resolution||link||2018-10-26||1221|
|354||Facial Expression Research Group Database (FERG-DB), University of Washington, Seattle||FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The chara...||Face, Facial expression, Animation, Stylization, annotation emotion, deep learning, anger, sad, joy, disgust, surprise, neutral, fear, cardinal classification, human transfer, image retrieval||link||2019-12-01||1542|
|328||UT Zappos50K||UT Zappos50K (UT-Zap50K) is a large shoe dataset consisting of 50,025 catalog images collected from Zappos.com. The images are divided into 4 major categories -...||fine-grained, ranking, local learning, pairwise comparison, shoe, attribute||link||2018-09-11||1256|
|316||Extreme Classification Repository||The Extreme Classification Repository: Multi-label Datasets & Code Kush Bhatia • Kunal Dahiya • Himanshu Jain • Yashoteja Prabhu • Manik Varma The objecti...||machine learning multilabel classification benchmark evaluation||link||2018-03-19||1559|
|291||MIT Places205||Places205 dataase contains 2.5 million images from 205 scene categories for the academic public. The image dataset contains 2,448,873 images from 205 scene c...||place recognition urban scene feature learning||link||2016-02-24||1484|
|256||Multi-Task Facial Landmark (MTFL) dataset||This dataset contains 12,995 face images which are annotated with (1) five facial landmarks, (2) attributes of gender, smiling, wearing glasses, and head pose. ...||face, landmark detection, deep learning, cnn, attribute||link||2015-11-07||2971|
|184||MSR Learning to Rank||The MSR Learning to Rank are two large scale datasets for research on learning to rank: MSLR-WEB30k with more than 30,000 queries and a random sampling of it MS...||rank learning sampling search||link||2019-08-16||1153|
|147||FlickrLogos-32||The FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval methods...||flickr, logo, detection, retrieval, image, object recognition, machine learning, classification brand boundingbox||link||2019-11-12||2544|
|146||Multiple Instance Learning dataset||MIL data sets used in our 2002 NIPS paper for Elepphant, Musk, TREC http://www.cs.cmu.edu/~juny/MILL/MIL-experiments.htm...||machine learning, classification||link||2013-05-30||1225|
|145||KnapSack||KNAPSACK_01 is a dataset directory which contains some examples of data for 01 Knapsack problems. In the 01 Knapsack problem, we are given a knapsack of fixe...||machine learning, classification||link||2018-06-04||1367|
|104||Make3D Depth||The Make3D Depth dataset s designed to learn features to estimate scene depth from a single image. This dataset contains aligned image and range data: Make3...||depth, learning, single view, outdoor, indoor||link||2019-04-03||2274|
|51||PN Learning||PN Learning - How does TLD work? Tracking estimates the object location as long as the object is visible. During tracking all observed patterns of the object...||single target tracking learning object pedestrian bike face||link||2017-11-28||1393|
|49||PhotoTourism Pair Patch||The data is taken from Photo Tourism reconstructions from Trevi Fountain (Rome), Notre Dame (Paris) and Half Dome (Yosemite). Each dataset consists of a series ...||feature matching description pair sfm patch learning||link||2018-01-10||1480|