This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 641 tags of 641 total tags for 454 datasets (1.41) »
|453||San Diego State University - Open Turbulent Image Set (OTIS)||Despite the existence of several turbulence mitigation algorithms in the literature, no common dataset exists to objectively evaluate their efficiency. This dat...||Image Sequence Atmospheric Turbulence Restoration Evaluation||link||2018-04-16||11|
|432||Collaborative 3D reconstruction with smartphones||collaborative 3d reconstruction with smartphones dataset: Six off-the-shelf Android smartphones captured video streams (Table 1, see below) of three cultural h...||collaborative 3d reconstruction smartphone image cloud video||link||2018-03-15||33|
|427||CITY-OSM - ETH Zurich||# Learning Aerial Image Segmentation From Online Maps This is the ground truth data generated for the publication Learning Aerial Image Segmentation F...||semantic computer vision aerial image segmentation map geoscience remote sensing deep learning berlin chicaco paris potsdam tokyo zurich||link||2018-01-25||152|
|424||Automatic Image Cropping||The Automatic Image Cropping dataset contains ill-composed images with manual crops provided by qualified experts. As described in Section 2.1, our visual co...||image crop automatic aesthetics multimedia||link||2018-01-10||123|
|413||DPED: DSLR Photo Enhancement Dataset||We introduce a large-scale DPED dataset that consists of photos taken synchronously in the wild by three smartphones and one DSLR camera. The devices used to co...||dped image photo enhancement deep learning computer vision||link||2017-10-24||160|
|395||AWS Public Datasets||AWS hosts a variety of public datasets that anyone can access for free. Previously, large datasets such as satellite imagery or genomic data have required hour...||amazon classification deep learning segmentation recognition satellite human biology space image resolution||link||2017-07-28||380|
|393||ZuBuD+||ZuBuD+, created in February 2017 by Federico Magliani (University of Parma), introduces many query images balancing the class evaluated from the previous datase...||landmark, building, image retrieval, urban||link||2017-07-17||243|
|388||Open Images Dataset||Today, we introduce Open Images, a dataset consisting of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories. We tried ...||classification large-scale category real image deep annotation automatic||link||2017-07-02||399|
|380||CERTH Image Blur Dataset||The CERTH image blur dataset consists of 2450 digital images, 1850 out of which are photographs captured by various camera models in different shooting conditio...||blur motion defocus detection quality image||link||2017-05-24||354|
|354||Facial Expression Research Group Database (FERG-DB), University of Washington, Seattle||FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The chara...||Face, Facial expression, Animation, Stylization, annotation emotion, deep learning, anger, sad, joy, disgust, surprise, neutral, fear, cardinal classification, human transfer, image retrieval||link||2017-02-27||755|
|343||FIRE Fundus Image Registration Dataset||A benchmark dataset for the evaluation of retinal image registration methods is introduced. The dataset consists on 134 image pairs and is annotated with ground...||retina retinal image registration fundus eye||link||2016-10-17||516|
|335||General 100||General-100 dataset contains 100 bmp-format images (with no compression). We used this dataset in our FSRCNN ECCV 2016 paper. The size of these 100 images range...||image superresolution||link||2017-07-22||724|
|325||Synthesized Inverse Synthetic Aperture Radar (ISAR) Images of Aircrafts||The database contains synthesized inverse synthetic aperture radar images of seven aircraft models. Reference: Hari Kishan Kondaveeti, Valli Kumari Va...||ISAR, image, classification||link||2016-03-17||762|
|313||Automotive Multi-sensor (AMUSE)||The automotive multi-sensor (AMUSE) dataset consists of inertial and other complementary sensor data combined with monocular, omnidirectional, high frame rate v...||street urban inertial video image traffic city api||link||2017-11-28||982|
|309||Coutour patches||The contour patches dataset is a large dataset of images patch matches used for contour detection. References: C. L. Zitnick and D. Parikh The Role of Im...||patch image match contour edge lowlevel detection segmentation||link||2015-09-29||666|
|280||Yahoo Flickr Creative Commons 100M||Yahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. All the ...||flickr landmark image recognition detection reconstruction 3d clustering social community internet||link||2015-09-24||1104|
|236||iCoseg dataset||iCoseg dataset introduces the largest publicly available co-segmentation dataset of 38 groups (643 images), along with pixel ground-truth hand annotations....||image co-segmentation||link||2017-06-22||1196|
|209||Symmetry Set||The Symmetry set dataset is a collection of images at different illuminations for the purpose of image matching using local symmetry features. Image Matching...||symmetry matching feature image illumination lighting urban building||link||2017-05-03||1035|
|147||FlickrLogos-32||The FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval methods...||flickr, logo, detection, retrieval, image, object recognition, machine learning, classification brand boundingbox||link||2018-03-08||1391|
|44||UK Bench||The UK Bench dataset from Henrik Stewenius and David Nister contains 10200 images of N=2550 groups with each four images at size 640x480. The images are rotated...||retrieval image object centered rotation||link||2017-12-20||2368|
|20||CALTECH 256||The CALTECH 256 dataset by Li Fei-Fei contains 30607 images for 256 categories....||classification centered object scene image||link||2013-08-08||962|
|19||CALTECH 101||The CALTECH 101 dataset by Li Fei-Fei contains images for 101 categories with about 40 to 800 images per category. Most categories have about 50 images at rough...||classification centered object scene image||link||2013-08-08||964|