This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 591 tags of 591 total tags for 421 datasets (1.4) »
|409||NII Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection||We present Okutama-Action, a new video dataset for aerial view concurrent human action detection. It consists of 43 minute-long fully-annotated sequences with 1...||action detection aerial view uav drone pedestrian multi-human tracking||link||2017-09-20||81|
|408||PETS 2016 IPATCH dataset||The PETS 2016 IPATCH dataset contains a set of fourteen multi camera recordings (visible, themal) collected off the coast of Brest, France, in collaboration wit...||maritime vessel boat detection tracking thermal visible gps radar multimodal||link||2017-09-16||79|
|347||MOCAT (TUB Multi-Object and Multi-Camera Tracking Dataset)||The TU Berlin Multi-Object and Multi-Camera Tracking Dataset (MOCAT) is a synthetic dataset to train and test tracking and detection systems in a virtual world....||synthetic tracking detection multi-class multi-view evaluation pedestrian vehicle animal||link||2016-11-02||575|
|288||Berkeley Urban Street tracking||The UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted on a ca...||tracking detection segmentation multitarget recognition video pedestrian urban human||link||2015-07-14||1200|
|286||HDA Person Dataset - ISR Lisbon||The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestrian De...||Video Surveillance Pedestrian Detection Re-Identification Multiview Tracking Benchmark Indoor High-Definition Camera Network lisbon human||link||2017-10-02||1769|
|216||CVC Partial Occlusion Virtual Pedestrian||The CVC Partial Occlusion Virtual Pedestrian datasets (CVC-01 to CVC-06) cover a range of scenarios of occluded pedestrians generated in a virtual and real envi...||detection classification tracking pedestrian synthetic urban occlusion||link||2016-03-15||1331|
|210||Traffic Video dataset||The Traffic Video dataset consists of X video of an overhead camera showing a street crossing with multiple traffic scenarios. The dataset can be downloaded ...||urban traffic tracking detection overhead view road video||link||2014-02-03||2778|
|201||50 Salads||The dataset captures 25 people preparing 2 mixed salads each and contains over 4h of annotated accelerometer and RGB-D video data. Annotated activities correspo...||action activity recognition classification detection tracking video||link||2013-10-05||902|
|188||KTH Multiview Football||The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ...||multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget||link||2016-09-18||1428|
|169||QMUL Junction Dataset||The QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frames) Fra...||detection tracking crowd counting pedestrian video motion behavior||link||2016-12-06||1395|
|168||Mall Dataset||The Mall dataset was collected from a publicly accessible webcam for crowd counting and profiling research. Ground truth: Over 60,000 pedestrians were label...||detection tracking crowd counting pedestrian indoor video webcam||link||2016-12-06||1695|
|166||ICG Multi-Camera Datasets||The ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (crowded sc...||multiview pedestrian tracking detection object camera calibration graz indoor video multitarget||link||2015-06-19||1208|
|165||ICG Multi-Camera and Virtual PTZ||The ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video from a sph...||multiview pedestrian tracking detection object camera calibration graz network video panorama crowd outdoor multitarget||link||2017-08-19||1313|
|164||ICG Lab 6 (Multi-Camera Multi-Object Tracking)||The ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1347 came...||multiview pedestrian tracking detection object laboratory camera calibration evaluation segmentation graz||link||2017-12-05||1814|
|151||People in WBCN||This dataset is for people tracking in wide baseline camera networks and was designed as a contest at ICPR 2012. The contest consists of two challenges: ...||detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial||link||2013-08-02||1313|
|150||SDHA Contest||The Semantic Description of Human Activities (SDHA) was a contest at ICPR 2010. The contest is composed of three different types of activity recognition cha...||detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial||link||2013-07-31||966|
|107||BIWI Pedestrians||We provide the three datasets used for testing our system for our ICCV 2007 publication, including annotations. Data was recorded using a pair of AVT Marlins mo...||detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion||link||2013-03-12||1160|
|106||BIWI Walking Pedestrians (EWAP)||The BIWI Walking Pedestrians (EWAP) dataset shows walking pedestrians in busy scenarios from a bird eye view. Manually annotated. Data used for training in our ...||detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial||link||2013-08-02||1690|
|68||The KITTI Vision Benchmark Suite||We take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest are: ste...||stereo, depth, flow, detection tracking, reconstruction, sfm, odometry, segmentation, semantic car depth||link||2017-11-26||1361|
|60||PSU HUB||The PSU HUB dataset is a detection, tracking dataset. Ground truth trajectory and grouping information for pedestrians walking in the PSU student union building...||detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion||link||2013-07-19||1062|
|16||PETS 2009||The PETS 2009 dataset contains 3 parts showing multi-view sequences containing pedestrians walking in an outdoor environment. The parts are used for person coun...||frontview, outdoor, pedestrian, detection, tracking, overlap, occlusion multitarget, human||link||2015-06-19||1405|
|15||PETS 2006||The PETS 2006 dataset contains 7 parts showing multi-sensor sequences containing left-luggage scenarios with increasing scene complexity at a train station scen...||frontview, indoor, pedestrian, detection, tracking, multitarget||link||2015-08-12||1123|
|9||TUD Crossing tracking||The TUD Crossing dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 201 images with 1008 highly overlapping pedestrians with significant va...||tracking detection segmentation multitarget pedestrian sideview overlap urban||link||2015-06-19||1937|