This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2018. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on? Add a new dataset! Yay!
«showing 697 tags of 697 total tags for 511 datasets (1.36) »
|491||Speech-driven 3D Facial Motion Database (S3DFM)||ANN: dynamic 2D/3D speaking face dataset with synchronized audio We would like to announce a new facial biometric dataset that has: * 1 second of 500 frame ...||speech motion face 3d recognition speaker||link||2019-03-09||313|
|426||UCL Motion Model Selection Dataset||The UCL Motion Model Selection Dataset contains videos in avi format, compressed with HuffYUV. They are separated into folders according to manual inspection-ba...||motion model real-world youtube video||link||2018-01-10||431|
|387||Edinburgh Ceilidh Overhead Video Data||This web page contains video data and ground truth for 16 dances with two different dance patterns. The style of dancing is inspired by Scottish Ceilidh dancing...||video dance chemistry pattern background motion analysis action||link||2017-07-02||492|
|380||CERTH Image Blur Dataset||The CERTH image blur dataset consists of 2450 digital images, 1850 out of which are photographs captured by various camera models in different shooting conditio...||blur motion defocus detection quality image||link||2020-01-14||1127|
|371||ICS-FORTH MHAD101 Action Co-segmentation||This is a custom generated dataset designed for the task of action co-segmentation in pairs of action sequences. The dataset contains 101 pairs of action se...||action co-segmentation, temporal segmentation, motion capture data, time series||link||2018-03-22||716|
|349||HKUST Ambiguity Dataset||This dataset contains two image collections, TempleOfHeaven and SportsArena, that are deemed hard for Structure-from-Motion (SfM). The method is described i...||Structure Motion, Ambiguous structure sfm||link||2018-06-22||1001|
|346||LASIESTA (Labeled and Annotated Sequences for Integral Evaluation of SegmenTation Algorithms)||LASIESTA is composed by many real indoor and outdoor sequences organized in different categories, each of one covering a specific challenge in moving object det...||dataset groundtruth motion object detection foreground background subtraction challenge stationary camera||link||2017-09-12||975|
|305||SPHERE human skeleton movements||The SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of the clas...||human action behavior motion movement video skeleton depth kinect||link||2016-03-24||1253|
|299||CAMP-TUM: Multiple Human Pose Estimation from Multiple Views||We introduce the Shelf dataset for multiple human pose estimation from multiple views. In addition we annotate the body joints in the Campus dataset from CVLAB@...||3D human pose estimation multiple view motion capture||link||2015-07-15||1495|
|298||Freiburg-Berkeley Motion Segmentation||The Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames is anno...||video segmentation benchmark object tracking pedestrian groundtruth motion||link||2017-03-21||1725|
|296||Video Segmentation Benchmark||The Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divided into...||video segmentation benchmark object tracking pedestrian groundtruth motion||link||2018-11-12||2121|
|269||Daimler Urban Segmentation Dataset||The Daimler Urban Segmentation Dataset consists of video sequences recorded in urban traffic. The dataset consists of 5000 rectified stereo image pairs with a r...||semantic segmentation outdoor urban stereo motion||link||2015-06-26||2088|
|249||Image Sequence Analysis Test Site (EISATS)||The .enpeda.. Image Sequence Analysis Test Site (EISATS) offers sets of long bi- or trinocular image sequences recorded in the context of vision-based driver as...||stereo vision optical flow motion analysis semantic segmentation||link||2014-09-30||1661|
|241||Malaya Abrupt Motion (MAMo) Dataset||The Malaya Abrupt Motion (MAMo) dataset is targeted for visual tracking, particularly for abrupt motion tracking. It was collected from publicly accessible data...||visual tracking, abrupt motion tracking||link||2016-11-05||1526|
|234||UMD Dynamic Scene Recognition||The UMD Dynamic Scene Recognition dataset consists of 13 classes and 10 videos per class and is used to classify dynamic scenes. The dataset has been describ...||scene recognition classification dynamic video motion||link||2017-01-05||1421|
|226||Fish4Knowledge||The Fish4Knowledge project (groups.inf.ed.ac.uk/f4k/) is pleased to announce the availability of 2 subsets of our tropical coral reef fish video and extracted...||classification animal fish video motion nature recognition water camera||link||2014-05-15||1695|
|219||JPL First-Person Interaction||JPL First-Person Interaction dataset (JPL-Interaction dataset) is composed of human activity videos taken from a first-person viewpoint. The dataset particularl...||video action recognition interactive motion human||link||2014-02-03||1043|
|207||CASIA Gait Recognition Dataset||Dataset A (former NLPR Gait Database) was created on Dec. 10, 2001, including 20 persons. Each person has 12 image sequences, 4 sequences for each of the three ...||gait recognition biometry action classification motion human foot pressure||link||2017-03-10||3640|
|204||UCF Person and Car VideoSeg||The UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big car, sm...||video segmentation object motion model camera groundtruth||link||2015-04-19||1552|
|203||GaTech VideoSeg||The GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation annotat...||video segmentation object motion model camera||link||2013-10-09||1676|
|202||GaTech SegTrack||The SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for accura...||video segmentation object proposal flow optical motion model camera stationary groundtruth||link||2013-10-09||1458|
|180||Airport MotionSeg||The Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is challenging b...||motion segmentation airport video clustering camera zoom||link||2013-09-04||1546|
|169||QMUL Junction Dataset||The QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frames) Fra...||detection tracking crowd counting pedestrian video motion behavior||link||2016-12-06||2167|
|157||Background Models Challenge (BMC)||Background Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concern: -...||background modeling change motion detection surveillance video segmentation||link||2019-08-15||2650|
|141||Berkeley Multimodal Human Action Database (MHAD)||The Berkeley Multimodal Human Action Database (MHAD) contains 11 actions performed by 7 male and 5 female subjects in the range 23-30 years of age except for on...||action classification multiview motion recognition||link||2014-02-03||1498|
|113||Penn-Fudan Pedestrian||Penn-Fudan Pedestrian Detection and Segmentation...||pedestrian detection segmentation background motion||link||2013-08-08||1610|
|80||Hopkins 155||The Hopkins 155 Dataset has been created with the goal of providing an extensive benchmark for testing feature based motion segmentation algorithms. It contains...||flow, stereo, motion, segmentation, urban||link||2015-04-01||1908|