Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   action   activity   address   adhead   adjustment   aerial   aesthetic   aesthetics   age   aic   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   and   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulated   aspect   attention   attribute   attributes   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   bike   bilateral   binary   biology   biometric   biometry   blender   blur   body   bone   bottle   boundingbox   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   caltech   camera   canada   captioning   capture   car   cardinal   categorization   category   celebrity   cell   centered   chair   challenge   change   chemistry   chest   chromaticity   church   circle   cities   city   classification   clothing   clustering   clutter   cnn   co-segmentation   coco   code   codebook   coffee   color   community   comparison   conditions   constancy   context   contour   cooking   copyright   cosegmentation   counting   cover   cow   crepe   cross-view   crowd   ct   cutting   dance   data   dataset   day   decomposition   deep   defocus   deformation   dense   depth   description   descriptor   detail   detection   dichromatic   disgust   disparity   dogs   domain   driving   dubrovnik   duplicate   dynamic   ear   ecocentric   edge   egocentric   ellipses   emotion   endtoend   estimation   evaluation   event   expression   eye   facade   face   facial   fear   feature   field   fine-grained   fingerprints   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   food   foot   foreground   fov   frames   frontview   fundus   gait   game   gaze   gender   genetic   genome   geography   geometry   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   graz   ground   groundtruth   group   hand   hands   handwritten   hd   head   heart   heat   hierarchy   high-definition   highlight   highway   holes   horse   human   identification   illumination   image   imagenet   images   imdb   indoor   inertial   initialization   inserts   instance   intake   interaction   interactive   interest   internet   invariance   ir   isar   joy   kernels   keyframe   kimia   kinect   label   labeling   laboratory   landmark   lane   language   large   large-scale   laser   lattice   layout   learning   letter   leuven   lidar   light   lightfield   lighting   limited   line   lisbon   liver   local   localization   location   logo   lowlevel   machine   manhattan   map   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   model   modeling   modelling   monitoring   mono   montage   motion   motion-capture-data   motorbike   mouse   movement   movie   mpeg   mug   multi-camera   multi-class   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multiple   multitarget   multiview   naming   natural   nature   navigation   network   neutral   newyork   night   noise   normal   nude   number   object   objects   occlusion   ocr   odometry   omnidirection   omnidirectional   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   people   person   perspective   phase   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   polygon   popularity   pornography   pose   presentation   pressure   primitive   privacy   procedural   profile   proposal   ptz   quality   question   radar   randomnoise   rank   ranking   ransac   rate   ratio   re-identification   real   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   reidentification   remote   removal   rendering   repetition   resolution   retina   retinal   retrieval   rgb   rgbd   road   robot   robust   rome   room   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   scenes   search   segmentation   semantic   sense   sensing   sequence   sfm   shadow   shadows   shape   shapes   sheffield   shoes   shots   shutter   sideview   sign   similarity   simultaneous   single   singletarget   singleview   skeleton   sketch   skin   sky   slam   soccer   social   software   source   space   spain   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   streetside   streetview   structure   structure-from-motion   structured   structures   study   stuff   stylization   subpixel   subtraction   summarization   summary   superresolution   supervised   surface   surgery   surprise   surveillance   swan   switzerland   symmetry   synthetic   table   target   taxonomy   temporal   text   texture   texture-less   therapy   thermal   things   time   time-series   tiny   tool   tools   top-view   tracking   traffic   trajectory   transfer   transportation   triangulation   truth   tuberculosis   type   uas   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vehicles   video   videosurveillance   view   viewpoint   vision   visual   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wikipedia   wild   workflow   world   xray   year   zoom   zurich  
«showing 562 tags of 562 total tags for 411 datasets (1.37) »


segmentation
DID Name Description Tags URL Date Views
398 Osnabruck - Gaze Tracking Data Set Gaze data on video stimuli for computer vision and visual analytics. Converted 318 video sequences from several different gaze tracking data sets with polygo... segmentation, gaze data, polygon annotation, video, metadata link 2017-09-12 44
396 ADE20k Scene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. mages ... segmentation semantic annotation benchmark scene recognition link 2017-08-03 88
395 AWS Public Datasets AWS hosts a variety of public datasets that anyone can access for free. Previously, large datasets such as satellite imagery or genomic data have required hour... amazon classification deep learning segmentation recognition satellite human biology space image resolution link 2017-07-28 99
394 Matterport 2D-3D-Semantics Data The 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotations. I... 3d panorama semantic segmentation depth normal indoor building reconstruction large-scale link 2017-07-27 71
376 ScanNet ScanNet is an RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and instance-le... scene indoor synthetic cad room layout rendering realism 3d segmentation object recognition link 2017-05-12 182
375 SUNCG: Indoor Scenes The SUNCG dataset is a Large 3D Model Repository for Indoor Scenes. SUNCG is an ongoing effort to establish a richly-annotated, large-scale dataset of 3D s... scene indoor synthetic room layout rendering realism 3d segmentation object recognition link 2017-05-02 160
374 SceneNet RGB-D Synthetic Indoor SceneNet RGB-D is dataset comprised of 5 million Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth. It expands the previous work of Scene... scene indoor synthetic robot navigation rendering 3d reconstruction trajectory lighting segmentation slam link 2017-05-02 151
373 DAVIS: Densely Annotated VIdeo Segmentation We present the 2017 DAVIS Challenge, a public competition specifically designed for the task of video object segmentation. Following the footsteps of other succ... object tracking segmentation video benchmark code hd quality resolution link 2017-08-03 164
372 VOT2016 segmentation The VOT2016 pixel-wise annotations dataset contains pixel-wise per-frame annotations for sequences from VOT2016 dataset. The annotation is in a form of BW image... object tracking segmentation mask annotation visual link 2017-04-17 181
371 ICS-FORTH MHAD101 Action Co-segmentation This is a custom generated dataset designed for the task of action co-segmentation in pairs of action sequences. The dataset contains 101 pairs of action se... action cosegmentation, temporal segmentation, motion-capture-data, time-series link 2017-08-29 138
365 Pedestrian Color Naming Dataset Pedestrian Color Naming (PCN) dataset contains 14,213 images, each of which hand-labeled with color label for each pixel.... Pedestrian, segmentation, color naming link 2017-03-13 410
359 a a... segmentation n/a 2017-01-19 258
357 udacity self-driving-car At Udacity, we believe in democratizing education. How can we provide opportunity to everyone on the planet? We also believe in teaching really amazing and usef... car robot driving autonomous street urban video recognition detection classification segmentation time synthetic link 2017-03-15 529
356 The Oxford RobotCar Dataset The Oxford RobotCar Dataset contains over 100 repetitions of a consistent route through Oxford, UK, captured over a period of over a year. The dataset captures ... car robot driving autonomous street urban video recognition detection classification segmentation time year link 2017-01-04 446
353 COCO-Stuff COCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks like sema... semantic segmentation stuff things COCO captioning annotation groundtruth benchmark link 2017-02-16 494
334 LabelMeFacade The LabelMeFacade dataset contains buildings, windows, sky and a limited number of unlabeled regions (maximally 20% covering of the image). This procedure res... segmentation semantic facade urban rectified recognition link 2016-08-23 533
333 UBC3V Dataset UBC3V is a synthetic dataset for training and evaluation of single or multiview depth-based pose estimation techniques. The nature of the data is similar to the... depth segmentation pose link 2016-08-18 445
330 Cityscapes We present a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high quality... stereo video urban cities semantic segmentation detection car person pedestrian weakly link 2016-07-19 1014
329 Virginia Tech and Arab Academy for Science & Technology (VT-AAST) The VT-AAST Benchmarking Dataset A New Color Image Database for Benchmarking of Face Detection Techniques and Human Skin Segmentation Techniques​. A new color face image database for ... face, detection, skin, segmentation, benchmarking, link 2016-07-11 515
315 Geosemantic The Geosemantic is a dataset of object locations from GIS and a query image with metadata. It is used to project the buildings and streets that are in the field... semantic segmentation gps geography supervised gis link 2016-01-07 498
310 FASSEG - FAce Semantic Segmentation The FAce Semantic SEGmentation (FASSEG) repository contains datasets for multi-class semantic face segmentation. The FASSEG repository is composed by two dat... face, segmentation link 2017-04-04 916
309 Coutour patches The contour patches dataset is a large dataset of images patch matches used for contour detection. References: C. L. Zitnick and D. Parikh The Role of Im... patch image match contour edge lowlevel detection segmentation link 2015-09-29 511
307 HandNet annotated hand dataset The HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 214971 a... hand articulation segmentation classification detection pose fingertip rgbd video link 2017-09-12 878
298 Freiburg-Berkeley Motion Segmentation The Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames is anno... video segmentation benchmark object tracking pedestrian groundtruth motion link 2017-03-21 935
297 Berkeley Video Segmentation The Berkeley Video Segmentation Dataset (BVSD) contains videos for segmentation (boundary?) Dataset train Dataset test... video segmentation benchmark link 2015-07-14 668
296 Video Segmentation Benchmark The Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divided into... video segmentation benchmark object tracking pedestrian groundtruth motion link 2017-03-21 979
292 Mobile Phone and Webcam Hand Images for Personal Authentication and Identification This work attempts to provide two Hand Images Databases for hand biometrics: one is created using a mobile phone camera of modest quality, which we called mob... mobile webcam hand authentication Identification person biometric shape segmentation link 2015-11-09 600
290 UWO GCO Volume Segmentation The Western GCO Segmentation problem instances are provided to compare effects of graph size, neighborhood size, length of s to t paths, regional arc consistenc... medical liver babyface bone abdomen adhead face segmentation binary optimization link 2015-06-19 498
288 Berkeley Urban Street tracking The UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted on a ca... tracking detection segmentation multitarget recognition video pedestrian urban human link 2015-07-14 1136
283 ISPRS WG III/4 ISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get further inform... aerial multiview 3d photogrammetry germany canada semantic segmentation urban city recognition benchmark link 2015-06-16 612
281 Tuberculosis image and patient data Permanently growing database on lung tuberculosis patients. The data include radiological images (CT+XRay) plus social, clinical, and lab data as well as full g... chest xray CT tuberculosis genome medical segmentation link 2016-08-06 763
269 Daimler Urban Segmentation Dataset The Daimler Urban Segmentation Dataset consists of video sequences recorded in urban traffic. The dataset consists of 5000 rectified stereo image pairs with a r... semantic segmentation outdoor urban stereo motion link 2015-06-26 1141
266 Paris Art Deco Facades The Paris Art Deco Facades dataset consists of 79 / 80 images of rectified facades of the architectural style Art Deco, which has different sizes of windows, de... paris semantic segmentation recognition architecture facade urban city procedural grammar link 2015-01-20 634
251 ETHZ CVL RueMonge 2014 This ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1] and p... semantic segmentation 3d reconstruction architecture paris benchmark source code urban recognition classification outdoor pointcloud mesh link 2014-11-24 1307
249 Image Sequence Analysis Test Site (EISATS) The .enpeda.. Image Sequence Analysis Test Site (EISATS) offers sets of long bi- or trinocular image sequences recorded in the context of vision-based driver as... stereo vision optical flow motion analysis semantic segmentation link 2014-09-30 1008
247 PASCAL VOC Parts The PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation mask for 2... detection recognition pascal object part pedestrian human segmentation semantic link 2014-09-30 1207
244 Pedestrian Parsing on Surveillance Scenes (PPSS) dataset The Pedestrian Parsing dataset contains 3,673 images from 171 videos of different Surveillance Scenes (PPSS), where 2,064 images are occluded and 1,609 are not.... Pedestrian, Parsing, Segmentation link 2017-03-21 1528
240 Microsoft COCO The Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other features: Mo... object context segmentation detection recognition benchmark semantic link 2015-05-02 1441
235 Kindergarten Video Surveillance The dataset consist of the about 50 hours obtained from kindergarten surveillance videos. Dataset, totally approximately 100 videos sequences (1000GB, 50 hours)... human action behavior segmentation video background surveillance link 2015-10-08 1259
233 PASCAL Context We would like to announce the release of PASCAL-Context dataset. We augmented PASCAL VOC 2010 dataset with annotations for 400+ additional categories. In the cu... semantic segmentation pascal benchmark category recognition dense shape link 2014-07-17 890
232 Pratheepan Human Skin Detection Dataset The images in this dataset are downloaded randomly from Google for human skin detection research. It has been used in the paper: W.R. Tan, C.S. Chan, Y. Prathee... skin detection, skin segmentation, human detection, skin dataset link 2017-09-14 2324
229 Paris Rue Madame Paris-rue-Madame dataset contains 3D Mobile Laser Scanning (MLS) data from rue Madame, a street in the 6th Parisian district (France). The test zone contains ap... semantic segmentation pointcloud 3d laser classification link 2014-06-10 702
228 MPI VehicleScenes Abstract Scene understanding has (again) become a focus of computer vision research, leveraging advances in detection, context modeling, and tracking. In thi... semantic segmentation scene understanding classification 3d car pedestrian link 2014-06-10 1079
220 3D Mask Attack Dataset The 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for both re... 3d biometry face recognition segmentation frontview emotion link 2016-03-14 1015
217 Youtube-Objects dataset The YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 videos for... video object detection segmentation flow optical link 2014-02-03 1014
212 Polo Instance Segmentation The Polo instance segmentation dataset is a semantic segmentation task for Hough transform based segmentation masks. It consists of supervised segmentation for ... semantic segmentation horse human outdoor mask scene understanding n/a 2016-01-21 913
206 GaTech VideoContext The GaTech VideoContext dataset consists of over 100 groundtruth annotated outdoor videos with over 20000 frames for the task of geometric context evaluation i... video geometry context classification semantic segmentation unsupervised supervised outdoor urban nature link 2014-04-06 886
204 UCF Person and Car VideoSeg The UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big car, sm... video segmentation object motion model camera groundtruth link 2015-04-19 1015
203 GaTech VideoSeg The GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation annotat... video segmentation object motion model camera link 2013-10-09 904
202 GaTech SegTrack The SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for accura... video segmentation object proposal flow optical motion model camera stationary groundtruth link 2013-10-09 827
198 THUS10000 The THUS10000 benchmark dataset comprises of 10,000 images, each of which has an unambiguous salient object and the object region is accurately annotated with p... segmentation saliency object detection visual attention link 2015-01-11 1047
197 Stanford Background Dataset The Stanford Background Dataset is a new dataset introduced in Gould et al. (ICCV 2009) for evaluating methods for geometric and semantic scene understanding. T... semantic segmentation urban classification nature geometry link 2016-01-21 1611
195 Yotta The Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or driving. ... semantic segmentation urban video camera 3d reconstruction classification link 2013-09-30 891
185 Kung-Fu fighter Multi-View The test sequences provide interested researchers a real-world multi-view test data set captured in the blue-c portals. The data is meant to be used for testing... multiview tracking segmentation camera action link 2013-10-08 896
180 Airport MotionSeg The Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is challenging b... motion segmentation airport video clustering camera zoom link 2013-09-04 929
179 CMP Facades The CMP Facade dataset consists of facade images assembled at the Center for Machine Perception, which includes 600 rectified images of facades from various sou... facade rectification urban semantic classification recognition structure similarity segmentation link 2015-06-19 717
177 SIPI textures The Textures volume currently contains 154 images, all monochrome, 129 512x512 and 25 1024x1024. For the Brodatz texture images, the number in parenthesis (i... texture, segmentation, classification, benchmark, synthetic, evaluation link 2013-08-20 889
176 Brodatz Album The Brodatz dataset consists of 112 textures in grayscale images of various texture types. http://www.ee.oulu.fi/research/imag/texture/image_data/Brodatz32.h... texture, segmentation, classification, benchmark, synthetic link 2014-12-23 1088
175 Outex texture bench The Outex dataset is part of a framework for empirical evaluation of texture classification and segmentation algorithms. The framework is being constructed acc... texture, segmentation, classification, benchmark, synthetic link 2015-11-17 705
173 MuHAVi and MAS human action The Multicamera Human Action Video Data (MuHAVi) Manually Annotated Silhouette Data (MAS) are two datasets consisting of selected action sequences for the eval... human action behavior segmentation video background link 2017-07-25 1581
172 DynTex dataset The DynTex dataset consists of a comprehensive set of Dynamic Textures. Dynamic, or temporal, texture is a spatially repetitive, time-varying visual pattern tha... texture, segmentation, dynamic, synthetic, video repetition link 2013-08-12 785
171 CHALEARN Multi-modal Gesture Challenge The CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton data. ht... gesture, kinect, recognition, human, action, illumination, depth, segmentation, skeleton link 2013-08-09 804
164 ICG Lab 6 (Multi-Camera Multi-Object Tracking) The ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1347 came... multiview pedestrian tracking detection object laboratory camera calibration evaluation segmentation graz link 2017-09-18 1702
157 Background Models Challenge (BMC) Background Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concern: -... background modeling change motion detection surveillance video segmentation link 2016-02-24 1377
149 NYU Depth v2 The NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinec... semantic segmentation depth kinect label reconstruction link 2017-06-01 1591
148 NYU Depth v1 The NYU-Depth data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect. ... semantic segmentation depth kinect label reconstruction link 2014-10-05 988
138 Buffy The Buffy dataset contains images selected from the TV series, Buffy: the Vampire Slayer. We select a set of 452 images from the first two episodes for training... segmentation, detection, buffy, movie, human link 2015-02-07 714
136 3D Object in Clutter Recognition and Segmentation The dataset is composed of 150 synthetic scenes, captured with a (perspective) virtual camera, and each scene contains 3 to 5 objects. The model set is composed... recognition, segmentation, mesh, synthetic link 2013-08-08 896
123 CMU/VMR Urban Image+Laser CMU/VMR Urban Image+Laser dataset contains 372 images linked with 3D laser points projections. There are additional images (due to the laser scanner being turne... reconstruction, sfm, urban, semantic, segmentation, laser link 2013-04-02 983
121 Oakland 3D This repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purposes. ... reconstruction, sfm, urban, semantic, segmentation, laser link 2014-06-10 968
113 Penn-Fudan Pedestrian Penn-Fudan Pedestrian Detection and Segmentation... pedestrian detection segmentation background motion link 2013-08-08 949
112 SHREC Unlike the previous SHREC contests, the objective of this SHREC 2012 contest is to evaluate the performance of 3D-mesh segmentation techniques instead of evalua... segmentation, mesh, part, 3d link 2013-07-29 648
111 Grabcut To evaluate our method we designed a new ground truth database of 50 images. The following zip-files contain: Data, Segmentation, Labelling - Lasso, Labelling -... segmentation, boundingbox, color, optimization, background link 2015-06-19 664
105 MSR 3D Video These sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zitnick, ... reconstruction, camera, segmentation, depth link 2013-03-12 860
100 Sowerby The Sowerby dataset contains 105 images for semantic segmentation.... semantic, segmentation, outdoor n/a 2014-09-26 884
99 BSDS500 This new dataset is an extension of the BSDS300, where the original 300 images are used for training / validation and 200 fresh images, together with human anno... segmentation, edge, contour, detection link 2013-03-12 851
98 BSDS300 The goal of this work is to provide an empirical basis for research on image segmentation and boundary detection. To this end, we have collected 12,000 hand-la... segmentation, edge, contour, detection link 2013-03-12 855
90 eTrims The eTrims dataset is comprised of two datasets, the 4-Class eTRIMS Dataset with 4 annotated object classes and the 8-Class eTRIMS Dataset with 8 annotated obje... semantic, segmentation, urban, reconstruction link 2013-03-12 712
89 Corel Photo Gallery This image database is a part of the "Corel Gallery Magic" (commercial product). It contains 80000 images divided into 800 categories of 100 images. These image... semantic, segmentation, outdoor n/a 2017-01-19 751
87 Simpsons 40 years Simpsons Homer 40 years is a dataset showing Homer Simpson over the course of 40 years. It is used for video segmentation and shape matching between frames.... video, segmentation, shape, matching n/a 2017-07-11 812
86 ICG Graz240 The ICG Graz240 dataset consists of 240 buildings with 5400 redundant images with a total of 5542 window instances. Window detection itself is difficult due to ... segmentation, detection, semantic, urban, graz link 2016-03-29 955
81 Zurich Hoengg Zurich Hoengg (Switzerland) is an aerial dataset. The dataset consists of 4 aerial images in colour (Figures 2-5), scanned with 14 microns, the format is Ti... aerial, semantic, segmentation, outdoor link 2013-03-11 803
80 Hopkins 155 The Hopkins 155 Dataset has been created with the goal of providing an extensive benchmark for testing feature based motion segmentation algorithms. It contains... flow, stereo, motion, segmentation, urban link 2015-04-01 1015
79 LabelMe The goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by visitin... segmentation, semantic, outdoor, detection, urban, software link 2013-03-14 804
75 ETHZ Shape The ETHZ Shape classes dataset from Vittorio Ferrari [?] consists of five object classes and a total of 255 images. All classes contain significant intra-class ... shape, detection, matching, segmentation, clutter, applelogo, bottle, giraffe, nature, swan, mug link 2014-02-11 812
68 The KITTI Vision Benchmark Suite We take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest are: ste... stereo, depth, flow, detection tracking, reconstruction, sfm, odometry, segmentation, semantic car depth link 2014-02-10 1289
62 Deformed Lattice Detection The Deformed Lattice Detection In Real-World Images dataset is used for regular grid detection. The authors have developed a robust and fast lattice detection a... texture, segmentation, symmetry, lattice, detection, urban link 2013-03-11 763
59 Near-Regular Textures The Near-Regular Textures dataset contains textures from completely regular to completely irregular patterns, with a focus on near-regular textures. It also inc... texture, segmentation, classification, symmetry, regular, stochastic link 2013-03-11 753
58 INRIA Horses The INRIA Horses dataset from Frederic Jurie and Vittorio Ferrari consists of 170 images with one or more horses in side-view at several scales and cluttered ba... detection, shape, segmentation, clutter, nature, horse link 2013-03-11 757
57 Weizmann Horses The multi-scale Weizmann horses (originally from Eran Borenstein, adapted by Jamie Shotton) consists of 656 images which is split into 50+50training, 50+50 vali... detection, shape, segmentation, clutter, nature, horse link 2013-03-11 1048
56 ETHZ Extended Shape The ETHZ Extended Shape classes dataset from Konrad Schindler is larger dataset of shape categories, created by merging ETHZ shape classes with Konrad Schindler... detection, shape, segmentation, clutter link 2013-03-11 721
55 Prague Texture Segmentation The Prague Texture Segmentation Datagenerator and Benchmark is designed to mutually compare and rank different (dynamic/static) texture segmenters (supervised o... texture, segmentation, classification, benchmark, synthetic link 2013-08-08 725
42 Hollywood Videos Hollywood-2 datset contains 12 classes of human actions and 10 classes of scenes distributed over 3669 video clips and approximately 20.1 hours of video in t... action, classification, video, segmentation link 2013-03-12 1028
41 KTH Action The current video database containing six types of human actions (walking, jogging, running, boxing, hand waving and hand clapping) performed several times by 2... action, classification, video, segmentation link 2013-03-12 699
40 Weizmann Action The Weizmann actions dataset by Blank, Gorelick, Shechtman, Irani, and Basri consists of ten different types of actions: bending, jumping jack, jumping, jump in... video, segmentation, action, classification link 2015-07-14 757
39 Leuven Stereo Scene The Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detection and... segmentation, semantic, reconstruction, urban, sfm, 3d, leuven, depth, stereo link 2013-11-03 1686
38 IcgBench The Interactive Segmentation (IcgBench) dataset from Jakob Santner contains 243 images and 262 segmentation. Some images have multiple segmentations. The annota... interactive, segmentation, user link 2013-03-11 675
37 MSRC vNIPS The MSRC vNIPS dataset is the MSRC v2 dataset with new annotations for much more accurate segmentations for 93 images. Efficient Inference in Fully Connected... segmentation, semantic, outdoor link 2013-03-11 703
36 MSRC v2 The MSRC v2 dataset is an extension of the MSRC v1 dataset from Microsoft Research in Cambridge. It contains 591 images and 23 object classes with accurate pixe... segmentation, semantic, outdoor link 2016-08-28 1855
35 MSRC v1 The MSRC v1 dataset from Microsoft Research in Cambridge contains 240 images and 9 object classes with coarse pixel-wise labeled images. The dataset is commonl... segmentation, semantic, outdoor link 2016-09-07 1458
34 CamVid The Cambridge-driving Labeled Video Database (CamVid) dataset from Gabriel Brostow [?] contains ten minutes of video footage and corresponding semantically labe... sfm, depth, semantic, segmentation, urban link 2016-04-18 2618
33 ECP New York 2011 The ECP New York dataset contains 10 manually segmented buildings from New York City, USA. Segmentation evaluating using Dice coefficient is calculated for the ... segmentation, semantic, procedural, reconstruction, urban, newyork link 2013-08-08 651
32 ECP Paris 2011 The ECP Paris 2011 dataset consists of 104 images taken from rue Monge in the fifth district of Paris, we kept only 20 for training and 10 for testing. Howev... segmentation, semantic, procedural, reconstruction, urban, paris link 2013-08-08 665
31 ECP Paris 2010 The Ecole Centrale Paris 2010 (Paris 2010) dataset consists of 30 images of densely annotated building facades in seven classes - wall, window, sky, shop, balco... segmentation, semantic, procedural, reconstruction, urban, paris link 2013-03-11 777
30 ICG Graz50 This is a dataset of rectified facade images and semantic labels. The goal of the annotation is to study the layout of the facades. It contains 50 images of... segmentation, semantic, procedural, reconstruction, urban, graz link 2014-01-28 855
25 PASCAL VOCs The PASCAL VOC Challenge datasets by Mark Everingham is a yearly dataset which has a central evaluation server and the final test data is not released. The late... detection segmentation pose pedestrian chair animal car building airplane link 2017-03-09 993
21 ImageNET The ImageNET dataset is the latest dataset by Li Fei-Fei containing various dataset ranging from 1000 to 10000 categories.... retrieval, segmentation, classification link 2013-03-11 927
18 Leeds Cows The Leeds Cows dataset by Derek Magee consists of 14 different video sequences showing a total of 18 cows walking from right to left in front of different backg... detection segmentation cow video background animal link 2013-08-08 897
12 TUD Pedestrians training The TUD Pedestrians training dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 210 and 400 training images with X pedestrians with signifi... segmentation, pedestrian, sideview link 2013-03-11 1413
11 TUD Campus The TUD Campus dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 71 images and 303 highly overlapping pedestrians with large scale changes... segmentation, pedestrian, sideview, overlap link 2013-03-11 1242
10 TUD Pedestrians The TUD Pedestrians dataset from Micha Andriluka, Stefan Roth and Bernt Schiele [AndrilukaCVPR2008] consists of 250 images with 311 fully visible people with si... segmentation, pedestrian, sideview link 2015-05-26 1508
9 TUD Crossing tracking The TUD Crossing dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 201 images with 1008 highly overlapping pedestrians with significant va... tracking detection segmentation multitarget pedestrian sideview overlap urban link 2015-06-19 1844


total views: 98854 5 queries in 0.0001368522644043s 0.00011897087097168s 0.00019502639770508s 2.1934509277344E-5s 0.0012180805206299s and total 0.0077130794525146s