This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 697 tags of 697 total tags for 511 datasets (1.36) »
|489||CADP: A Novel Dataset for CCTV Traffic Camera based Accident Analysis||Car Accident Detection and Prediction~(CADP) dataset consists of 1,416 video segments collected from YouTube, with 205 video segments have full spatio-temporal ...||Car Accident Detection, Accident Forecasting, CCTV analysis, Camera based accident analysis||link||2019-02-26||962|
|472||human3.6m||human3.6m dataset is one of the largest datasets for 3D human pose estimation. It consists of 3.6 million images featuring 11 actors performing 15 daily activ...||human pose estimation camera video 3d laser scan action actor body part mocap||link||2019-11-18||1164|
|462||Taskonomy||The Taskonomy dataset consists of 3.9 Mil. Scenes, 600 Buildings, 25 Tags per Image, 1024 Resolution for taxonomy and transfer learning tasks. We provide a larg...||transfer learning taxonomy task deep indoor 3d mesh pose camera high-resolution||link||2018-08-08||362|
|459||MVSEC||The Multi Vehicle Stereo Event Camera dataset is a collection of data designed for the development of novel 3D perception algorithms for event based cameras. St...||event camera speed intensity dynamic gps imu 3d benchmark||link||2018-05-30||352|
|443||ApolloScape Semantic Segmentation||The ApolloScape Parsing dataset is provided by Baidu for the CVPR 2018 Workshop on Autonomous Driving Challenge. It is expected that the Scene Parsing dataset ...||segmentation semantic scene benchmark size urban autonomous driving camera calibration video||link||2019-03-08||955|
|346||LASIESTA (Labeled and Annotated Sequences for Integral Evaluation of SegmenTation Algorithms)||LASIESTA is composed by many real indoor and outdoor sequences organized in different categories, each of one covering a specific challenge in moving object det...||dataset groundtruth motion object detection foreground background subtraction challenge stationary camera||link||2017-09-12||975|
|332||Multi-FoV - Large Field-of-View Cameras for Visual Odometry||The Multi-FoV synthetic datasets are two synthetic scenes (vehicle moving in a city, and flying robot hovering in a confined room). For each scene, three differ...||visual odometry camera fov synthetic groundtruth blender||link||2016-08-11||1140|
|286||HDA Person Dataset - ISR Lisbon||The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestrian De...||Video Surveillance Pedestrian Detection Re-Identification Multiview Tracking Benchmark Indoor High-Definition Camera Network lisbon human||link||2019-04-23||3700|
|226||Fish4Knowledge||The Fish4Knowledge project (groups.inf.ed.ac.uk/f4k/) is pleased to announce the availability of 2 subsets of our tropical coral reef fish video and extracted...||classification animal fish video motion nature recognition water camera||link||2014-05-15||1695|
|215||WILD -Weather and Illumination Database||The Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all seasons. It...||webcam light illumination camera video static change urban time depth estimation weather newyork||link||2016-04-19||2784|
|214||The Webcam Clip Art Dataset||This is a subset of the dataset introduced in the SIGGRAPH Asia 2009 paper, Webcam Clip Art: Appearance and Illuminant Transfer from Time-lapse Sequences. As...||webcam light illumination camera video static change urban nature time||link||2014-02-01||1266|
|205||GaTech VideoStab||The GaTech VideoStab dataset consists of N videos for the task of video stabilization. This code is implemented in Youtube video editor for stabilization. ...||video stabilization camera path||link||2013-10-09||1375|
|204||UCF Person and Car VideoSeg||The UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big car, sm...||video segmentation object motion model camera groundtruth||link||2015-04-19||1552|
|203||GaTech VideoSeg||The GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation annotat...||video segmentation object motion model camera||link||2013-10-09||1676|
|202||GaTech SegTrack||The SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for accura...||video segmentation object proposal flow optical motion model camera stationary groundtruth||link||2013-10-09||1458|
|195||Yotta||The Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or driving. ...||semantic segmentation urban video camera 3d reconstruction classification||link||2013-09-30||1364|
|188||KTH Multiview Football||The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ...||multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget||link||2018-06-28||2672|
|185||Kung-Fu fighter Multi-View||The test sequences provide interested researchers a real-world multi-view test data set captured in the blue-c portals. The data is meant to be used for testing...||multiview tracking segmentation camera action||link||2013-10-08||1420|
|180||Airport MotionSeg||The Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is challenging b...||motion segmentation airport video clustering camera zoom||link||2013-09-04||1546|
|166||ICG Multi-Camera Datasets||The ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (crowded sc...||multiview pedestrian tracking detection object camera calibration graz indoor video multitarget||link||2015-06-19||1940|
|165||ICG Multi-Camera and Virtual PTZ||The ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video from a sph...||multiview pedestrian tracking detection object camera calibration graz network video panorama crowd outdoor multitarget||link||2017-08-19||2051|
|164||ICG Lab 6 (Multi-Camera Multi-Object Tracking)||The ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1347 came...||multiview pedestrian tracking detection object laboratory camera calibration evaluation segmentation graz||link||2017-12-05||2843|
|156||KUL Belgium Traffic Signs||BelgiumTS is a large dataset with 10000+ traffic sign annotations, thousands of physically distinct traffic signs. 4 video sequences recorded with 8 high resolu...||traffic sign classification urban road belgium camera calibration||link||2020-01-07||2199|
|105||MSR 3D Video||These sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zitnick, ...||reconstruction, camera, segmentation, depth||link||2013-03-12||1435|