This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 697 tags of 697 total tags for 514 datasets (1.36) »
|483||CUHK DeepFashion2||DeepFashion2 is a comprehensive fashion dataset. It contains 491K diverse images of 13 popular clothing categories from both commercial shopping stores and cons...||fashion apparel attributes recognition localization human benchmark polygon annotation instance semantic segmentation||link||2019-08-18||1109|
|482||CUHK DeepFashion||We contribute DeepFashion database, a large-scale clothes database, which has several appealing properties: First, DeepFashion contains over 800,000 diverse ...||fashion apparel attributes recognition localization human benchmark polygon annotation instance semantic segmentation||link||2019-05-24||864|
|481||ModaNet and PaperDoll||ModaNet is a street fashion images dataset consisting of annotations related to RGB images. ModaNet provides multiple polygon annotations for each image. This d...||fashion apparel attributes recognition localization human benchmark polygon annotation instance semantic segmentation||link||2019-01-09||716|
|471||CrowdFlower||/! Commercial annotation platform, not a publicly released dataset Our Human-in-the-Loop Machine Learning platform transforms unstructured text, image, audio, ...||dataset benchmark annotation||link||2018-09-11||382|
|470||MVOR||MVOR is a Multi-view Multi-person RGB-D Operating Room Dataset for 2D and 3D Human Pose Estimation We are pleased to announce the release of the MVOR datase...||medical clinical human annotation multiview pose estimation rgbd operation hospital||link||2018-10-08||493|
|444||Supervisely Person Dataset||The Supervisely Person Dataset consists of 5711 images with 6884 high-quality annotated person instances. All steps below are done inside Supervisely without a...||person pedestrian segmentation semantic mask supervisely annotation automatic dataset instance||link||2020-06-01||3570|
|418||Udacity Annotated Driving Datasets||Udacity Annotated Driving Datasets have two datasets: Dataset 1 The dataset includes driving in Mountain View California and neighboring cities during dayli...||classification segmentation urban street selfdriving autonomous udacity annotation california city daylight||link||2020-05-07||1326|
|404||Zurich Summer Dataset||The Zurich Summer v1.0 dataset is a collection of 20 chips (crops), taken from a QuickBird acquisition of the city of Zurich (Switzerland) in August 2002. Quick...||satellite segmentation semantic aerial urban city zurich pan nir rgb gsd superpixel annotation||link||2017-09-12||1083|
|398||Osnabrück - Gaze Tracking Data Set||Gaze data on video stimuli for computer vision and visual analytics. Converted 318 video sequences from several different gaze tracking data sets with polygo...||segmentation, gaze data, polygon annotation, video, metadata||link||2018-02-13||752|
|396||ADE20k||Scene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. mages ...||segmentation semantic annotation benchmark scene recognition||link||2017-08-03||831|
|388||Open Images Dataset v4 new||Today, we introduce Open Images, a dataset consisting of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories. We tried ...||classification large-scale category real image deep annotation automatic benchmark boundingbox||link||2018-09-11||826|
|372||VOT2016 segmentation||The VOT2016 pixel-wise annotations dataset contains pixel-wise per-frame annotations for sequences from VOT2016 dataset. The annotation is in a form of BW image...||object tracking segmentation mask annotation visual||link||2017-04-17||701|
|354||Facial Expression Research Group Database (FERG-DB), University of Washington, Seattle||FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The chara...||Face, Facial expression, Animation, Stylization, annotation emotion, deep learning, anger, sad, joy, disgust, surprise, neutral, fear, cardinal classification, human transfer, image retrieval||link||2019-12-01||1550|
|353||COCO-Stuff||COCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks like sema...||semantic segmentation stuff things COCO caption annotation groundtruth benchmark||link||2019-01-09||1414|
|161||ICG Annotated Facial Landmarks in the Wild (AFLW)||The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet...||face detection landmark pose age annotation||link||2020-02-18||4513|