This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 505 tags of 505 total tags for 366 datasets (1.38) »
|305||SPHERE human skeleton movements||The SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of the clas...||human action behavior motion movement video skeleton depth kinect||link||2016-03-24||489|
|299||CAMP-TUM: Multiple Human Pose Estimation from Multiple Views||We introduce the Shelf dataset for multiple human pose estimation from multiple views. In addition we annotate the body joints in the Campus dataset from CVLAB@...||3D human pose estimation multiple view motion capture||link||2015-07-15||402|
|298||Freiburg-Berkeley Motion Segmentation||The Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames is anno...||video segmentation benchmark object tracking pedestrian groundtruth motion||link||2017-03-21||653|
|296||Video Segmentation Benchmark||The Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divided into...||video segmentation benchmark object tracking pedestrian groundtruth motion||link||2017-03-21||740|
|269||Daimler Urban Segmentation Dataset||The Daimler Urban Segmentation Dataset consists of video sequences recorded in urban traffic. The dataset consists of 5000 rectified stereo image pairs with a r...||semantic segmentation outdoor urban stereo motion||link||2015-06-26||789|
|249||Image Sequence Analysis Test Site (EISATS)||The .enpeda.. Image Sequence Analysis Test Site (EISATS) offers sets of long bi- or trinocular image sequences recorded in the context of vision-based driver as...||stereo vision optical flow motion analysis semantic segmentation||link||2014-09-30||809|
|241||Malaya Abrupt Motion (MAMo) Dataset||The Malaya Abrupt Motion (MAMo) dataset is targeted for visual tracking, particularly for abrupt motion tracking. It was collected from publicly accessible data...||visual tracking, abrupt motion tracking||link||2016-11-05||798|
|234||UMD Dynamic Scene Recognition||The UMD Dynamic Scene Recognition dataset consists of 13 classes and 10 videos per class and is used to classify dynamic scenes. The dataset has been describ...||scene recognition classification dynamic video motion||link||2017-01-05||728|
|226||Fish4Knowledge||The Fish4Knowledge project (groups.inf.ed.ac.uk/f4k/) is pleased to announce the availability of 2 subsets of our tropical coral reef fish video and extracted...||classification animal fish video motion nature recognition water camera||link||2014-05-15||741|
|219||JPL First-Person Interaction||JPL First-Person Interaction dataset (JPL-Interaction dataset) is composed of human activity videos taken from a first-person viewpoint. The dataset particularl...||video action recognition interactive motion human||link||2014-02-03||515|
|207||CASIA Gait Recognition Dataset||Dataset A (former NLPR Gait Database) was created on Dec. 10, 2001, including 20 persons. Each person has 12 image sequences, 4 sequences for each of the three ...||gait recognition biometry action classification motion human foot pressure||link||2017-03-10||1881|
|204||UCF Person and Car VideoSeg||The UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big car, sm...||video segmentation object motion model camera groundtruth||link||2015-04-19||803|
|203||GaTech VideoSeg||The GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation annotat...||video segmentation object motion model camera||link||2013-10-09||663|
|202||GaTech SegTrack||The SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for accura...||video segmentation object proposal flow optical motion model camera stationary groundtruth||link||2013-10-09||624|
|180||Airport MotionSeg||The Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is challenging b...||motion segmentation airport video clustering camera zoom||link||2013-09-04||678|
|169||QMUL Junction Dataset||The QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frames) Fra...||detection tracking crowd counting pedestrian video motion behavior||link||2016-12-06||1025|
|157||Background Models Challenge (BMC)||Background Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concern: -...||background modeling change motion detection surveillance video segmentation||link||2016-02-24||1096|
|141||Berkeley Multimodal Human Action Database (MHAD)||The Berkeley Multimodal Human Action Database (MHAD) contains 11 actions performed by 7 male and 5 female subjects in the range 23-30 years of age except for on...||action classification multiview motion recognition||link||2014-02-03||695|
|113||Penn-Fudan Pedestrian||Penn-Fudan Pedestrian Detection and Segmentation...||pedestrian detection segmentation background motion||link||2013-08-08||639|
|80||Hopkins 155||The Hopkins 155 Dataset has been created with the goal of providing an extensive benchmark for testing feature based motion segmentation algorithms. It contains...||flow, stereo, motion, segmentation, urban||link||2015-04-01||780|