This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 684 tags of 684 total tags for 487 datasets (1.4) »
|468||NewBarkTex||The BarkTex database includes six tree bark classes, with 68 images per class. To build the New BarkTex set, a region of interest, centered on the bark and whos...||Bark, texture, computer vision, classification||link||2018-09-07||78|
|467||BarkTex||This image database contains a collection of 408 color textures for the computer vision community. The pictures show the bark of six different European trees....||Bark, texture, computer vision, classification||link||2018-09-07||89|
|466||Trunk12||Since there is no known publicly available tree bark image data set, a new publicly available data set was created as a part of Bsc thesis. It contains about 36...||Bark, texture, computer vision, classification||link||2018-09-07||77|
|427||CITY-OSM - ETH Zurich||# Learning Aerial Image Segmentation From Online Maps This is the ground truth data generated for the publication Learning Aerial Image Segmentation F...||semantic computer vision aerial image segmentation map geoscience remote sensing deep learning berlin chicaco paris potsdam tokyo zurich||link||2018-01-25||338|
|413||DPED: DSLR Photo Enhancement Dataset||We introduce a large-scale DPED dataset that consists of photos taken synchronously in the wild by three smartphones and one DSLR camera. The devices used to co...||dped image photo enhancement deep learning computer vision||link||2017-10-24||279|
|400||Visual Discriminative Question Generation (VDQG) dataset||The dataset contains 11202 ambiguous image pairs collected from Visual Genome. Each image pair is annotated with 4.6 discriminative questions and 5.9 non-discri...||vision language VQA question genome biology||link||2017-09-12||566|
|351||CMLA Subpixel Stereo Dataset||A 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very precise t...||stereo stereo vision subpixel groundtruth 3D pointcloud noise depth||link||2019-01-08||701|
|323||UT Egocentric (UT Ego) Dataset||The Univ. of Texas at Austin Egocentric (UT Ego) Dataset contains 4 videos captured from head-mounted cameras. Each video is about 3-5 hours long, captured in ...||First-person vision, egocentric||link||2016-03-17||697|
|249||Image Sequence Analysis Test Site (EISATS)||The .enpeda.. Image Sequence Analysis Test Site (EISATS) offers sets of long bi- or trinocular image sequences recorded in the context of vision-based driver as...||stereo vision optical flow motion analysis semantic segmentation||link||2014-09-30||1340|