Yet Another Computer Vision Index To Datasets (YACVID) - Details

Stand: 2020-07-05 000000m 08:20:37 - Overview

Attribute Current Content New
Name (Institute + Shorttitle)human3.6m 
Description (include details on usage, files and paper references)human3.6m dataset is one of the largest datasets
for 3D human pose estimation. It consists of 3.6 million
images featuring 11 actors performing 15 daily activities,
such as eating, sitting, walking and taking a photo, from
4 camera views. The ground-truth 3D poses are captured
by the Mocap system, while the 2D poses can be obtained
by projection with the known intrinsic and extrinsic camera

Diversity and Size
3.6 million 3D human poses and corresponding images

11 professional actors (6 male, 5 female)

17 scenarios (discussion, smoking, taking photo, talking on the phone...)

Accurate Capture and Synchronization
High-resolution 50Hz video from 4 calibrated cameras

Accurate 3D joint positions and joint angles from high-speed motion capture system

Pixel-level 24 body part labels for each configuration

Time-of-flight range data

3D laser scans of the actors

Accurate background subtraction, person bounding boxes

Support for Development
Precomputed image descriptors

Software for visualization and discriminative human pose prediction

Performance evaluation on withheld test set
URL Link 
Files (#)3600000 
References (SKIPPED)
Category (SKIPPED) 
Tags (single words, spaced)human pose estimation camera video 3d laser scan action actor body part mocap 
Last Changed2020-07-05 
Turing (2.12+3.25=?) :-)