did=217 task=did=217 YACVID - Youtube-Objects dataset - Details

Yet Another Computer Vision Index To Datasets (YACVID) - Details

Stand: 2024-05-11 05:19:24 - Overview

Attribute	Current content	New content
Name (Institute + Shorttitle)	Youtube-Objects dataset
Description (include details on usage, files and paper references)	The YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 videos for each class. The duration of each video varies between 30 seconds and 3 minutes. The videos are weakly annotated, i.e. we ensure that each video contains at one object of the corresponding class. In addition to the videos, this release also includes several materials from our paper [1] Bounding-boxes annotations. For evaluation purposes we annotated the object location in a few hundred video frames for each class (see sec. 6.1 [1]). Point tracks and motion segments. As produced by [2]. Tubes. Spatio-temporal bounding-boxes as described in section 3.2 [1]. We include all candidate tubes (yellow in the fig. above) as well as the tube automatically selected by our method (blue). [1] A. Prest, C. Leistner, J. Civera, C. Schmid and V. Ferrari. Learning Object Class Detectors fromWeakly Annotated Video Computer Vision and Pattern Recognition (CVPR), 2012.	The YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 videos for each class. The duration of each video varies between 30 seconds and 3 minutes. The videos are weakly annotated, i.e. we ensure that each video contains at one object of the corresponding class. In addition to the videos, this release also includes several materials from our paper [1] Bounding-boxes annotations. For evaluation purposes we annotated the object location in a few hundred video frames for each class (see sec. 6.1 [1]). Point tracks and motion segments. As produced by [2]. Tubes. Spatio-temporal bounding-boxes as described in section 3.2 [1]. We include all candidate tubes (yellow in the fig. above) as well as the tube automatically selected by our method (blue). [1] A. Prest, C. Leistner, J. Civera, C. Schmid and V. Ferrari. Learning Object Class Detectors fromWeakly Annotated Video Computer Vision and Pattern Recognition (CVPR), 2012.
URL Link	http://people.ee.ethz.ch/~presta/youtube-objects/website/youtube-objects.html
Files (#)	200
References (SKIPPED)	0
Category (SKIPPED)
Tags (single words, spaced)	video object detection segmentation flow optical
Last Changed	2024-05-11
Turing (2.12+3.25=?) :-)