This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 505 tags of 505 total tags for 366 datasets (1.38) »
|308||TST Intake Monitoring dataBase||t is composed of food intake movements, recorded with Kinect V1 (320×240 depth frame resolution), simulated by 35 volunteers for a total of 48 tests. The device...||human food intake monitoring behavior kinect pointcloud tracking age groundtruth||link||2016-02-11||300|
|305||SPHERE human skeleton movements||The SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of the clas...||human action behavior motion movement video skeleton depth kinect||link||2016-03-24||494|
|294||Happy People Images Database||Group emotion recognition in images - Happiness Intensity labels for group of people in images. The images have been collected from Flickr using keyword search ...||group, facial expression, emotion, wild, human, flickr, behavior||link||2015-07-13||503|
|235||Kindergarten Video Surveillance||The dataset consist of the about 50 hours obtained from kindergarten surveillance videos. Dataset, totally approximately 100 videos sequences (1000GB, 50 hours)...||human action behavior segmentation video background surveillance||link||2015-10-08||926|
|173||MuHAVi and MAS human action||The Multicamera Human Action Video Data (MuHAVi) Manually Annotated Silhouette Data (MAS) are two datasets consisting of selected action sequences for the eval...||human action behavior segmentation video background||link||2013-08-12||1146|
|169||QMUL Junction Dataset||The QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frames) Fra...||detection tracking crowd counting pedestrian video motion behavior||link||2016-12-06||1034|