|Description (include details on usage, files and paper references)||The NYU-Depth data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect. The dataset has several components:
Labeled: A subset of the video data accompanied by dense multi-class labels. This data has also been preprocessed to fill in missing depth labels.
Raw: The raw rgb, depth and accelerometer data as provided by the Kinect.
Toolbox: Useful functions for manipulating the data and labels.
The train/test splits used for evaluation.
64 different indoor scenes
7 scene types
108,617 unlabeled frames
2347 densely labeled frames
Inpainted and raw depth available