This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 665 tags of 665 total tags for 470 datasets (1.41) »
|410||Charades Activity Dataset||10,000 30sec videos from 267 volunteers, each annotated with multiple activities, captions, objects, and temporal localizations. From "Hollywood in Homes: Cr...||video activity recognition action object caption localization detection human daily||link||2018-03-22||328|
|327||PIROPO Database: People in Indoor ROoms with Perspective and Omnidirectional cameras||The PIROPO database (People in Indoor ROoms with Perspective and Omnidirectional cameras) comprises multiple sequences recorded in two different indoor rooms, u...||people surveillance perspective omnidirectional fisheye indoor room detection human||link||2017-02-16||1186|
|288||Berkeley Urban Street tracking||The UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted on a ca...||tracking detection segmentation multitarget recognition video pedestrian urban human||link||2015-07-14||1510|
|286||HDA Person Dataset - ISR Lisbon||The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestrian De...||Video Surveillance Pedestrian Detection Re-Identification Multiview Tracking Benchmark Indoor High-Definition Camera Network lisbon human||link||2017-10-02||2363|
|275||TST fall detection||It is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, with diff...||action recognition detection depth kinect wearable accelerometer human video||link||2017-03-14||1158|
|272||Stanford 40 Actions||The Stanford 40 Actions dataset contains images of humans performing 40 actions. In each image, we provide a bounding box of the person who is performing the ac...||human action recognition detection boundingbox||link||2015-06-19||1259|
|263||Crowd Dataset||The crowd datasets are collected from a variety of sources, such as UCF and data-driven crowd datasets. The sequences are diverse, representing dense crowd in t...||crowd video detection anomaly scene understanding human pedestrian||link||2017-09-19||1979|
|257||FaceScrub||The FaceScrub dataset comprises a total of 107818 unconstrained face images of 530 celebrities crawled from the Internet, with about 200 images per person. M...||face detection recognition celebrity people human||link||2018-06-30||1192|
|254||ChokePoint Dataset||We collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions using e...||human pedestrian identification recognition multiview sequence face detection real world surveillance clustering||link||2015-05-02||1529|
|247||PASCAL VOC Parts||The PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation mask for 2...||detection recognition pascal object part pedestrian human segmentation semantic||link||2014-09-30||1521|
|232||Pratheepan Human Skin Detection Dataset||The images in this dataset are downloaded randomly from Google for human skin detection research. It has been used in the paper: W.R. Tan, C.S. Chan, Y. Prathee...||skin detection, skin segmentation, human detection, skin dataset||link||2018-08-06||3686|
|227||Omnidirectional and panoramic image dataset||We share our omnidirectional and panoramic image dataset (with annotations) to be used for human and car detection. Please reach through: http://cvrg.iyte.edu....||panorama detection car omnidirection human recognition||link||2017-01-13||1685|
|213||ChairGest Gestures||ChairGest is an open challenge / benchmark. The task consists in spotting and recognizing gestures from multiple synchronized sensors: 1 Kinect and 4 Xsens Ine...||benchmark recognition kinect gesture detection human||link||2014-06-06||876|
|138||Buffy||The Buffy dataset contains images selected from the TV series, Buffy: the Vampire Slayer. We select a set of 452 images from the first two episodes for training...||segmentation, detection, buffy, movie, human||link||2015-02-07||906|
|16||PETS 2009||The PETS 2009 dataset contains 3 parts showing multi-view sequences containing pedestrians walking in an outdoor environment. The parts are used for person coun...||frontview, outdoor, pedestrian, detection, tracking, overlap, occlusion multitarget, human||link||2015-06-19||1706|
|14||INRIA People||The INRIA People dataset from Navneet Dalal and Bill Triggs [DalalCVPR2005] consists of training and testing data. The training contains 1805 images and X peopl...||detection, pedestrian, sideview, frontview, human, boundingbox||link||2015-06-19||1677|