Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   action   activity   address   adhead   adjustment   aerial   aesthetic   aesthetics   age   aic   aircraft   airplane   airport   ambiguous   analysis   and   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulated   aspect   attention   attribute   attributes   authentication   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   bike   bilateral   binary   biology   biometric   biometry   blender   body   bone   bottle   boundingbox   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   calibration   caltech   camera   canada   captioning   capture   car   cardinal   categorization   category   celebrity   cell   centered   chair   challenges;   change   chest   chromaticity   church   circle   cities   city   classification   clustering   clutter   cnn   co-segmentation   coco   code   codebook   coffee   color   community   comparison   conditions   constancy   context   contour   copyright   counting   cover   cow   crowd   ct   cutting   database;   dataset   dataset;   day   decomposition   deep   deformation   dense   depth   description   descriptor   detail   detection   detection;   dichromatic   disgust   disparity   dogs   domain   driving   dubrovnik   duplicate   dynamic   ear   ecocentric   edge   egocentric   ellipses   emotion   estimation   evaluation   event   expression   eye   facade   face   facial   fear   feature   field   fine-grained   fingerprints   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   food   foot   foreground   foreground;   fov   frontview   fundus   gait   game   genetic   genome   geography   geometry   geotag   germany   gesture   gif   giraffe   gis   global   google   gps   grammar   graphics   graz   ground   ground-truth;   groundtruth   group   hand   hands   handwritten   head   heart   heat   hierarchy   high-definition   highlight   highway   holes   horse   human   identification   illumination   image   imagenet   imdb   indoor   inertial   initialization   inserts   instance   intake   interaction   interactive   interest   internet   invariance   ir   isar   joy   keyframe   kimia   kinect   label   labeling   laboratory   landmark   lane   language   large   laser   lattice   layout   learning   letter   leuven   lidar   light   lightfield   lighting   limited   line   lisbon   liver   local   localization   location   logo   lowlevel   machine   manhattan   map   mask   match   matching   material   medial   medical   memorability   mesh   milling   mirror   mobile   model   modeling   modelling   monitoring   mono   montage   motion   motorbike   mouse   movement   movie   moving   mpeg   mug   multi-class   multi-mode   multi-view   multilabel   multiple   multitarget   multiview   natural   nature   navigation   network   neutral   newyork   night   noise   number   object   objects   occlusion   ocr   odometry   omnidirection   omnidirectional   optical   optimization   organ   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   panorama   panoramio   paris   parsing   part   partial   pasadena   pascal   patch   path   pedestrian   people   person   perspective   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   popularity   pose   pressure   primitive   procedural   profile   proposal   ptz   quality   radar   randomnoise   rank   ranking   ransac   rate   ratio   re-identification   real   recognition   reconstruction   rectification   rectified   reflection   registration   regular   remote   removal   repetition   retina   retinal   retrieval   rgb   rgbd   road   robot   robust   rome   room   rotation   sad   saliency   sampling   sanfrancisco   scale   scan   scanner   scene   scenes   search   segmentation   semantic   sense   sensing   sequence   sfm   shadow   shadows   shape   shapes   sheffield   shoes   shutter   sideview   sign   similarity   single   singletarget   singleview   skeleton   sketch   skin   sky   slam   soccer   social   software   source   spain   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   streetside   streetview   structure   structure-from-motion   structures   study   stuff   stylization   subpixel   subtraction;   summarization   summary   superresolution   supervised   surface   surprise   surveillance   swan   switzerland   symmetry   synthetic   target   taxonomy   text   texture   therapy   thermal   things   time   tiny   tool   tools   tracking   traffic   trajectory   transfer   transportation   triangulation   truth   tuberculosis   type   uas   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vehicles   video   video2gif   videosurveillance   view   viewpoint   vision   visual   volleyball   vt   water   weakly   wear   wearable   weather   webcam   white   wide   wikipedia   wild   world   xray   year   zoom   zurich  
«showing 505 tags of 505 total tags for 366 datasets (1.38) »


segmentation
DID Name Description Tags URL Date Views
359 a a... segmentation n/a 2017-01-19 61
357 udacity self-driving-car At Udacity, we believe in democratizing education. How can we provide opportunity to everyone on the planet? We also believe in teaching really amazing and usef... car robot driving autonomous street urban video recognition detection classification segmentation time synthetic link 2017-03-15 178
356 The Oxford RobotCar Dataset The Oxford RobotCar Dataset contains over 100 repetitions of a consistent route through Oxford, UK, captured over a period of over a year. The dataset captures ... car robot driving autonomous street urban video recognition detection classification segmentation time year link 2017-01-04 151
353 COCO-Stuff COCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks like sema... semantic segmentation stuff things COCO captioning annotation groundtruth benchmark link 2017-02-16 161
334 LabelMeFacade The LabelMeFacade dataset contains buildings, windows, sky and a limited number of unlabeled regions (maximally 20% covering of the image). This procedure res... segmentation semantic facade urban rectified recognition link 2016-08-23 264
333 UBC3V Dataset UBC3V is a synthetic dataset for training and evaluation of single or multiview depth-based pose estimation techniques. The nature of the data is similar to the... depth segmentation pose link 2016-08-18 222
330 Cityscapes We present a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high quality... stereo video urban cities semantic segmentation detection car person pedestrian weakly link 2016-07-19 664
329 Virginia Tech and Arab Academy for Science & Technology (VT-AAST) The VT-AAST Benchmarking Dataset A New Color Image Database for Benchmarking of Face Detection Techniques and Human Skin Segmentation Techniques​. A new color face image database for ... face, detection, skin, segmentation, benchmarking, link 2016-07-11 285
315 Geosemantic The Geosemantic is a dataset of object locations from GIS and a query image with metadata. It is used to project the buildings and streets that are in the field... semantic segmentation gps geography supervised gis link 2016-01-07 306
310 FASSEG - FAce Semantic Segmentation The FAce Semantic SEGmentation (FASSEG) dataset contains 70 labeled images for semantic segmentation of faces into 6 categories: skin, hair, eyes, nose, mouth a... face segmentation link 2015-10-05 547
309 Coutour patches The contour patches dataset is a large dataset of images patch matches used for contour detection. References: C. L. Zitnick and D. Parikh The Role of Im... patch image match contour edge lowlevel detection segmentation link 2015-09-29 331
307 HandNet annotated hand dataset The HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 214971 a... hands articulated segmentation classification detection pose fingertip link 2015-09-07 524
298 Freiburg-Berkeley Motion Segmentation The Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames is anno... video segmentation benchmark object tracking pedestrian groundtruth motion link 2017-03-21 657
297 Berkeley Video Segmentation The Berkeley Video Segmentation Dataset (BVSD) contains videos for segmentation (boundary?) Dataset train Dataset test... video segmentation benchmark link 2015-07-14 456
296 Video Segmentation Benchmark The Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divided into... video segmentation benchmark object tracking pedestrian groundtruth motion link 2017-03-21 745
292 Mobile Phone and Webcam Hand Images for Personal Authentication and Identification This work attempts to provide two Hand Images Databases for hand biometrics: one is created using a mobile phone camera of modest quality, which we called mob... mobile webcam hand authentication Identification person biometric shape segmentation link 2015-11-09 419
290 UWO GCO Volume Segmentation The Western GCO Segmentation problem instances are provided to compare effects of graph size, neighborhood size, length of s to t paths, regional arc consistenc... medical liver babyface bone abdomen adhead face segmentation binary optimization link 2015-06-19 354
288 Berkeley Urban Street tracking The UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted on a ca... tracking detection segmentation multitarget recognition video pedestrian urban human link 2015-07-14 831
283 ISPRS WG III/4 ISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get further inform... aerial multiview 3d photogrammetry germany canada semantic segmentation urban city recognition benchmark link 2015-06-16 418
281 Tuberculosis image and patient data Permanently growing database on lung tuberculosis patients. The data include radiological images (CT+XRay) plus social, clinical, and lab data as well as full g... chest xray CT tuberculosis genome medical segmentation link 2016-08-06 528
269 Daimler Urban Segmentation Dataset The Daimler Urban Segmentation Dataset consists of video sequences recorded in urban traffic. The dataset consists of 5000 rectified stereo image pairs with a r... semantic segmentation outdoor urban stereo motion link 2015-06-26 797
266 Paris Art Deco Facades The Paris Art Deco Facades dataset consists of 79 / 80 images of rectified facades of the architectural style Art Deco, which has different sizes of windows, de... paris semantic segmentation recognition architecture facade urban city procedural grammar link 2015-01-20 464
251 ETHZ CVL RueMonge 2014 This ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1] and p... semantic segmentation 3d reconstruction architecture paris benchmark source code urban recognition classification outdoor pointcloud mesh link 2014-11-24 1007
249 Image Sequence Analysis Test Site (EISATS) The .enpeda.. Image Sequence Analysis Test Site (EISATS) offers sets of long bi- or trinocular image sequences recorded in the context of vision-based driver as... stereo vision optical flow motion analysis semantic segmentation link 2014-09-30 814
247 PASCAL VOC Parts The PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation mask for 2... detection recognition pascal object part pedestrian human segmentation semantic link 2014-09-30 935
244 Pedestrian Parsing on Surveillance Scenes (PPSS) dataset The Pedestrian Parsing dataset contains 3,673 images from 171 videos of different Surveillance Scenes (PPSS), where 2,064 images are occluded and 1,609 are not.... Pedestrian, Parsing, Segmentation link 2017-03-21 1218
240 Microsoft COCO The Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other features: Mo... object context segmentation detection recognition benchmark semantic link 2015-05-02 1124
235 Kindergarten Video Surveillance The dataset consist of the about 50 hours obtained from kindergarten surveillance videos. Dataset, totally approximately 100 videos sequences (1000GB, 50 hours)... human action behavior segmentation video background surveillance link 2015-10-08 921
233 PASCAL Context We would like to announce the release of PASCAL-Context dataset. We augmented PASCAL VOC 2010 dataset with annotations for 400+ additional categories. In the cu... semantic segmentation pascal benchmark category recognition dense shape link 2014-07-17 680
232 Pratheepan Human Skin Detection Dataset The images in this dataset are downloaded randomly from Google for human skin detection research. It has been used in the paper: W.R. Tan, C.S. Chan, Y. Prathee... skin detection, skin segmentation, human detection, skin dataset link 2016-11-05 1566
229 Paris Rue Madame Paris-rue-Madame dataset contains 3D Mobile Laser Scanning (MLS) data from rue Madame, a street in the 6th Parisian district (France). The test zone contains ap... semantic segmentation pointcloud 3d laser classification link 2014-06-10 557
228 MPI VehicleScenes Abstract Scene understanding has (again) become a focus of computer vision research, leveraging advances in detection, context modeling, and tracking. In thi... semantic segmentation scene understanding classification 3d car pedestrian link 2014-06-10 876
220 3D Mask Attack Dataset The 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for both re... 3d biometry face recognition segmentation frontview emotion link 2016-03-14 716
217 Youtube-Objects dataset The YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 videos for... video object detection segmentation flow optical link 2014-02-03 764
212 Polo Instance Segmentation The Polo instance segmentation dataset is a semantic segmentation task for Hough transform based segmentation masks. It consists of supervised segmentation for ... semantic segmentation horse human outdoor mask scene understanding n/a 2016-01-21 613
206 GaTech VideoContext The GaTech VideoContext dataset consists of over 100 groundtruth annotated outdoor videos with over 20000 frames for the task of geometric context evaluation i... video geometry context classification semantic segmentation unsupervised supervised outdoor urban nature link 2014-04-06 678
204 UCF Person and Car VideoSeg The UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big car, sm... video segmentation object motion model camera groundtruth link 2015-04-19 808
203 GaTech VideoSeg The GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation annotat... video segmentation object motion model camera link 2013-10-09 669
202 GaTech SegTrack The SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for accura... video segmentation object proposal flow optical motion model camera stationary groundtruth link 2013-10-09 629
198 THUS10000 The THUS10000 benchmark dataset comprises of 10,000 images, each of which has an unambiguous salient object and the object region is accurately annotated with p... segmentation saliency object detection visual attention link 2015-01-11 846
197 Stanford Background Dataset The Stanford Background Dataset is a new dataset introduced in Gould et al. (ICCV 2009) for evaluating methods for geometric and semantic scene understanding. T... semantic segmentation urban classification nature geometry link 2016-01-21 1197
195 Yotta The Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or driving. ... semantic segmentation urban video camera 3d reconstruction classification link 2013-09-30 682
185 Kung-Fu fighter Multi-View The test sequences provide interested researchers a real-world multi-view test data set captured in the blue-c portals. The data is meant to be used for testing... multiview tracking segmentation camera action link 2013-10-08 661
180 Airport MotionSeg The Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is challenging b... motion segmentation airport video clustering camera zoom link 2013-09-04 685
179 CMP Facades The CMP Facade dataset consists of facade images assembled at the Center for Machine Perception, which includes 600 rectified images of facades from various sou... facade rectification urban semantic classification recognition structure similarity segmentation link 2015-06-19 542
177 SIPI textures The Textures volume currently contains 154 images, all monochrome, 129 512x512 and 25 1024x1024. For the Brodatz texture images, the number in parenthesis (i... texture, segmentation, classification, benchmark, synthetic, evaluation link 2013-08-20 696
176 Brodatz Album The Brodatz dataset consists of 112 textures in grayscale images of various texture types. http://www.ee.oulu.fi/research/imag/texture/image_data/Brodatz32.h... texture, segmentation, classification, benchmark, synthetic link 2014-12-23 841
175 Outex texture bench The Outex dataset is part of a framework for empirical evaluation of texture classification and segmentation algorithms. The framework is being constructed acc... texture, segmentation, classification, benchmark, synthetic link 2015-11-17 536
173 MuHAVi and MAS human action The Multicamera Human Action Video Data (MuHAVi) Manually Annotated Silhouette Data (MAS) are two datasets consisting of selected action sequences for the eval... human action behavior segmentation video background link 2013-08-12 1143
172 DynTex dataset The DynTex dataset consists of a comprehensive set of Dynamic Textures. Dynamic, or temporal, texture is a spatially repetitive, time-varying visual pattern tha... texture, segmentation, dynamic, synthetic, video repetition link 2013-08-12 601
171 CHALEARN Multi-modal Gesture Challenge The CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton data. ht... gesture, kinect, recognition, human, action, illumination, depth, segmentation, skeleton link 2013-08-09 620
164 ICG Lab 6 (Multi-Camera Multi-Object Tracking) The ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1347 came... multiview pedestrian tracking detection object laboratory camera calibration evaluation segmentation graz link 2013-10-08 1248
157 Background Models Challenge (BMC) Background Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concern: -... background modeling change motion detection surveillance video segmentation link 2016-02-24 1101
149 NYU Depth v2 The NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinec... semantic segmentation depth kinect label reconstruction link 2013-07-25 1092
148 NYU Depth v1 The NYU-Depth data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect. ... semantic segmentation depth kinect label reconstruction link 2014-10-05 709
138 Buffy The Buffy dataset contains images selected from the TV series, Buffy: the Vampire Slayer. We select a set of 452 images from the first two episodes for training... segmentation, detection, buffy, movie, human link 2015-02-07 546
136 3D Object in Clutter Recognition and Segmentation The dataset is composed of 150 synthetic scenes, captured with a (perspective) virtual camera, and each scene contains 3 to 5 objects. The model set is composed... recognition, segmentation, mesh, synthetic link 2013-08-08 710
123 CMU/VMR Urban Image+Laser CMU/VMR Urban Image+Laser dataset contains 372 images linked with 3D laser points projections. There are additional images (due to the laser scanner being turne... reconstruction, sfm, urban, semantic, segmentation, laser link 2013-04-02 788
121 Oakland 3D This repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purposes. ... reconstruction, sfm, urban, semantic, segmentation, laser link 2014-06-10 744
113 Penn-Fudan Pedestrian Penn-Fudan Pedestrian Detection and Segmentation... pedestrian detection segmentation background motion link 2013-08-08 645
112 SHREC Unlike the previous SHREC contests, the objective of this SHREC 2012 contest is to evaluate the performance of 3D-mesh segmentation techniques instead of evalua... segmentation, mesh, part, 3d link 2013-07-29 475
111 Grabcut To evaluate our method we designed a new ground truth database of 50 images. The following zip-files contain: Data, Segmentation, Labelling - Lasso, Labelling -... segmentation, boundingbox, color, optimization, background link 2015-06-19 489
105 MSR 3D Video These sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zitnick, ... reconstruction, camera, segmentation, depth link 2013-03-12 669
100 Sowerby The Sowerby dataset contains 105 images for semantic segmentation.... semantic, segmentation, outdoor n/a 2014-09-26 675
99 BSDS500 This new dataset is an extension of the BSDS300, where the original 300 images are used for training / validation and 200 fresh images, together with human anno... segmentation, edge, contour, detection link 2013-03-12 665
98 BSDS300 The goal of this work is to provide an empirical basis for research on image segmentation and boundary detection. To this end, we have collected 12,000 hand-la... segmentation, edge, contour, detection link 2013-03-12 674
90 eTrims The eTrims dataset is comprised of two datasets, the 4-Class eTRIMS Dataset with 4 annotated object classes and the 8-Class eTRIMS Dataset with 8 annotated obje... semantic, segmentation, urban, reconstruction link 2013-03-12 521
89 Corel Photo Gallery This image database is a part of the "Corel Gallery Magic" (commercial product). It contains 80000 images divided into 800 categories of 100 images. These image... semantic, segmentation, outdoor n/a 2017-01-19 560
87 Simpsons 40 years Simpsons Homer 40 years is a dataset showing Homer Simpson over the course of 40 years. It is used for video segmentation and shape matching between frames.... video, segmentation, shape, matching n/a 2016-04-19 593
86 ICG Graz240 The ICG Graz240 dataset consists of 240 buildings with 5400 redundant images with a total of 5542 window instances. Window detection itself is difficult due to ... segmentation, detection, semantic, urban, graz link 2016-03-29 687
81 Zurich Hoengg Zurich Hoengg (Switzerland) is an aerial dataset. The dataset consists of 4 aerial images in colour (Figures 2-5), scanned with 14 microns, the format is Ti... aerial, semantic, segmentation, outdoor link 2013-03-11 607
80 Hopkins 155 The Hopkins 155 Dataset has been created with the goal of providing an extensive benchmark for testing feature based motion segmentation algorithms. It contains... flow, stereo, motion, segmentation, urban link 2015-04-01 784
79 LabelMe The goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by visitin... segmentation, semantic, outdoor, detection, urban, software link 2013-03-14 597
75 ETHZ Shape The ETHZ Shape classes dataset from Vittorio Ferrari [?] consists of five object classes and a total of 255 images. All classes contain significant intra-class ... shape, detection, matching, segmentation, clutter, applelogo, bottle, giraffe, nature, swan, mug link 2014-02-11 622
68 The KITTI Vision Benchmark Suite We take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest are: ste... stereo, depth, flow, detection tracking, reconstruction, sfm, odometry, segmentation, semantic car depth link 2014-02-10 1008
62 Deformed Lattice Detection The Deformed Lattice Detection In Real-World Images dataset is used for regular grid detection. The authors have developed a robust and fast lattice detection a... texture, segmentation, symmetry, lattice, detection, urban link 2013-03-11 584
59 Near-Regular Textures The Near-Regular Textures dataset contains textures from completely regular to completely irregular patterns, with a focus on near-regular textures. It also inc... texture, segmentation, classification, symmetry, regular, stochastic link 2013-03-11 557
58 INRIA Horses The INRIA Horses dataset from Frederic Jurie and Vittorio Ferrari consists of 170 images with one or more horses in side-view at several scales and cluttered ba... detection, shape, segmentation, clutter, nature, horse link 2013-03-11 525
57 Weizmann Horses The multi-scale Weizmann horses (originally from Eran Borenstein, adapted by Jamie Shotton) consists of 656 images which is split into 50+50training, 50+50 vali... detection, shape, segmentation, clutter, nature, horse link 2013-03-11 842
56 ETHZ Extended Shape The ETHZ Extended Shape classes dataset from Konrad Schindler is larger dataset of shape categories, created by merging ETHZ shape classes with Konrad Schindler... detection, shape, segmentation, clutter link 2013-03-11 537
55 Prague Texture Segmentation The Prague Texture Segmentation Datagenerator and Benchmark is designed to mutually compare and rank different (dynamic/static) texture segmenters (supervised o... texture, segmentation, classification, benchmark, synthetic link 2013-08-08 526
42 Hollywood Videos Hollywood-2 datset contains 12 classes of human actions and 10 classes of scenes distributed over 3669 video clips and approximately 20.1 hours of video in t... action, classification, video, segmentation link 2013-03-12 827
41 KTH Action The current video database containing six types of human actions (walking, jogging, running, boxing, hand waving and hand clapping) performed several times by 2... action, classification, video, segmentation link 2013-03-12 538
40 Weizmann Action The Weizmann actions dataset by Blank, Gorelick, Shechtman, Irani, and Basri consists of ten different types of actions: bending, jumping jack, jumping, jump in... video, segmentation, action, classification link 2015-07-14 572
39 Leuven Stereo Scene The Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detection and... segmentation, semantic, reconstruction, urban, sfm, 3d, leuven, depth, stereo link 2013-11-03 1373
38 IcgBench The Interactive Segmentation (IcgBench) dataset from Jakob Santner contains 243 images and 262 segmentation. Some images have multiple segmentations. The annota... interactive, segmentation, user link 2013-03-11 490
37 MSRC vNIPS The MSRC vNIPS dataset is the MSRC v2 dataset with new annotations for much more accurate segmentations for 93 images. Efficient Inference in Fully Connected... segmentation, semantic, outdoor link 2013-03-11 530
36 MSRC v2 The MSRC v2 dataset is an extension of the MSRC v1 dataset from Microsoft Research in Cambridge. It contains 591 images and 23 object classes with accurate pixe... segmentation, semantic, outdoor link 2016-08-28 1471
35 MSRC v1 The MSRC v1 dataset from Microsoft Research in Cambridge contains 240 images and 9 object classes with coarse pixel-wise labeled images. The dataset is commonl... segmentation, semantic, outdoor link 2016-09-07 1155
34 CamVid The Cambridge-driving Labeled Video Database (CamVid) dataset from Gabriel Brostow [?] contains ten minutes of video footage and corresponding semantically labe... sfm, depth, semantic, segmentation, urban link 2016-04-18 1796
33 ECP New York 2011 The ECP New York dataset contains 10 manually segmented buildings from New York City, USA. Segmentation evaluating using Dice coefficient is calculated for the ... segmentation, semantic, procedural, reconstruction, urban, newyork link 2013-08-08 491
32 ECP Paris 2011 The ECP Paris 2011 dataset consists of 104 images taken from rue Monge in the fifth district of Paris, we kept only 20 for training and 10 for testing. Howev... segmentation, semantic, procedural, reconstruction, urban, paris link 2013-08-08 517
31 ECP Paris 2010 The Ecole Centrale Paris 2010 (Paris 2010) dataset consists of 30 images of densely annotated building facades in seven classes - wall, window, sky, shop, balco... segmentation, semantic, procedural, reconstruction, urban, paris link 2013-03-11 574
30 ICG Graz50 This is a dataset of rectified facade images and semantic labels. The goal of the annotation is to study the layout of the facades. It contains 50 images of... segmentation, semantic, procedural, reconstruction, urban, graz link 2014-01-28 630
25 PASCAL VOCs The PASCAL VOC Challenge datasets by Mark Everingham is a yearly dataset which has a central evaluation server and the final test data is not released. The late... detection segmentation pose pedestrian chair animal car building airplane link 2017-03-09 781
21 ImageNET The ImageNET dataset is the latest dataset by Li Fei-Fei containing various dataset ranging from 1000 to 10000 categories.... retrieval, segmentation, classification link 2013-03-11 668
18 Leeds Cows The Leeds Cows dataset by Derek Magee consists of 14 different video sequences showing a total of 18 cows walking from right to left in front of different backg... detection segmentation cow video background animal link 2013-08-08 665
12 TUD Pedestrians training The TUD Pedestrians training dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 210 and 400 training images with X pedestrians with signifi... segmentation, pedestrian, sideview link 2013-03-11 1063
11 TUD Campus The TUD Campus dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 71 images and 303 highly overlapping pedestrians with large scale changes... segmentation, pedestrian, sideview, overlap link 2013-03-11 924
10 TUD Pedestrians The TUD Pedestrians dataset from Micha Andriluka, Stefan Roth and Bernt Schiele [AndrilukaCVPR2008] consists of 250 images with 311 fully visible people with si... segmentation, pedestrian, sideview link 2015-05-26 1217
9 TUD Crossing tracking The TUD Crossing dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 201 images with 1008 highly overlapping pedestrians with significant va... tracking detection segmentation multitarget pedestrian sideview overlap urban link 2015-06-19 1357


total views: 71407 5 queries in 2.6941299438477E-5s 2.0027160644531E-5s 8.082389831543E-5s 2.2172927856445E-5s 0.0014879703521729s and total 0.0069711208343506s