Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset

2d   3d   4d   aachen   abdomen   abrupt   accelerometer   action   activity   address   adhead   adjustment   aerial   aesthetic   aesthetics   age   aic   aircraft   airplane   airport   ambiguous   analysis   and   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulated   aspect   attention   attribute   attributes   authentication   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   bike   bilateral   binary   biology   biometric   biometry   blender   body   bone   bottle   boundingbox   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   calibration   caltech   camera   canada   captioning   capture   car   cardinal   categorization   category   celebrity   cell   centered   chair   challenges;   change   chest   chromaticity   church   circle   cities   city   classification   clustering   clutter   cnn   co-segmentation   coco   code   codebook   coffee   color   community   comparison   conditions   constancy   context   contour   copyright   cosegmentation   counting   cover   cow   cross-view   crowd   ct   cutting   database;   dataset   dataset;   day   decomposition   deep   deformation   dense   depth   description   descriptor   detail   detection   detection;   dichromatic   disgust   disparity   dogs   domain   driving   dubrovnik   duplicate   dynamic   ear   ecocentric   edge   egocentric   ellipses   emotion   estimation   evaluation   event   expression   eye   facade   face   facial   fear   feature   field   fine-grained   fingerprints   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   food   foot   foreground   foreground;   fov   frames   frontview   fundus   gait   game   genetic   genome   geography   geometry   geotag   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   graz   ground   ground-truth;   groundtruth   group   hand   hands   handwritten   head   heart   heat   hierarchy   high-definition   highlight   highway   holes   horse   human   identification   illumination   image   imagenet   images   imdb   indoor   inertial   initialization   inserts   instance   intake   interaction   interactive   interest   internet   invariance   ir   isar   joy   keyframe   kimia   kinect   label   labeling   laboratory   landmark   lane   language   large   laser   lattice   layout   learning   letter   leuven   lidar   light   lightfield   lighting   limited   line   lisbon   liver   local   localization   location   logo   lowlevel   machine   manhattan   map   mask   match   matching   material   medial   medical   memorability   mesh   milling   mirror   mobile   model   modeling   modelling   monitoring   mono   montage   motion   motion-capture-data   motorbike   mouse   movement   movie   movies   moving   mpeg   mug   multi-camera;   multi-class   multi-mode   multi-sensor;   multi-view   multilabel   multiple   multitarget   multiview   naming   natural   nature   navigation   network   neutral   newyork   night   noise   nude   number   object   objects   occlusion   ocr   odometry   omnidirection   omnidirectional   open-view   optical   optimization   organ   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   panorama   panoramio   paris   parsing   part   partial   pasadena   pascal   patch   path   pedestrian   people   person   perspective   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   popularity   pornography   pose   presentation   pressure   primitive   procedural   profile   proposal   ptz   quality   radar   randomnoise   rank   ranking   ransac   rate   ratio   re-identification   real   recognition   recognition;   reconstruction   rectification   rectified   reflection   registration   regular   remote   removal   repetition   retina   retinal   retrieval   rgb   rgbd   road   robot   robust   rome   room   rotation   sad   saliency   sampling   sanfrancisco   scale   scan   scanner   scene   scenes   search   segmentation   semantic   sense   sensing   sequence   sfm   shadow   shadows   shape   shapes   sheffield   shoes   shots   shutter   sideview   sign   similarity   single   singletarget   singleview   skeleton   sketch   skin   sky   slam   soccer   social   software   source   spain   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   streetside   streetview   structure   structure-from-motion   structures   study   stuff   stylization   subpixel   subtraction;   summarization   summary   superresolution   supervised   surface   surprise   surveillance   swan   switzerland   symmetry   synthetic   target   taxonomy   temporal   text   texture   therapy   thermal   things   time   time-series   tiny   tool   tools   tracking   traffic   trajectory   transfer   transportation   triangulation   truth   tuberculosis   type   uas   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vehicles   video   video2gif   videos   videosurveillance   view   viewpoint   vision   visual   volleyball   vt   water   weakly   wear   wearable   weather   webcam   white   wide   wikipedia   wild   world   xray   year   zoom   zurich  
«showing 524 tags of 524 total tags for 372 datasets (1.41) »

DID Name Description Tags URL Date Views
369 Nude Detection Dataset — Images (NPDI/DCC/UFMG) The database contains 180 images collected from the Web. If you make use of our database, please cite the following reference: LOPES, Ana; AVILA, Sandra... nude detection, images link 2017-03-31 60
368 Nude Detection Dataset — Videos (NPDI/DCC/UFMG) The database of nude and non-nude videos contains a collection of 179 video segments collected from the following movies: Alpha Dog, Basic Instinct, Before The ... nude detection, videos, movies link 2017-03-31 41
364 ETH CVL IMDB WIKI Faces Since the publicly available face image datasets are often of small to medium size, rarely exceeding tens of thousands of images, and often without age informat... face imdb wikipedia detection recognition age biometry link 2017-02-22 139
357 udacity self-driving-car At Udacity, we believe in democratizing education. How can we provide opportunity to everyone on the planet? We also believe in teaching really amazing and usef... car robot driving autonomous street urban video recognition detection classification segmentation time synthetic link 2017-03-15 256
356 The Oxford RobotCar Dataset The Oxford RobotCar Dataset contains over 100 repetitions of a consistent route through Oxford, UK, captured over a period of over a year. The dataset captures ... car robot driving autonomous street urban video recognition detection classification segmentation time year link 2017-01-04 208
348 Global Symmetry AVA Dataset Global Symmetry Ground-truth for AVA dataset Release Date: 2016 For detailed information, please refer to: Elawady, Mohamed, Cécile Barat, Christophe Duc... Global Bilateral Symmetry Detection Aesthetic Reflection Mirror link 2016-11-02 144
347 MOCAT (TUB Multi-Object and Multi-Camera Tracking Dataset) The TU Berlin Multi-Object and Multi-Camera Tracking Dataset (MOCAT) is a synthetic dataset to train and test tracking and detection systems in a virtual world.... synthetic tracking detection multi-class multi-view evaluation pedestrian vehicle animal link 2016-11-02 257
330 Cityscapes We present a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high quality... stereo video urban cities semantic segmentation detection car person pedestrian weakly link 2016-07-19 733
329 Virginia Tech and Arab Academy for Science & Technology (VT-AAST) The VT-AAST Benchmarking Dataset A New Color Image Database for Benchmarking of Face Detection Techniques and Human Skin Segmentation Techniques​. A new color face image database for ... face, detection, skin, segmentation, benchmarking, link 2016-07-11 324
327 PIROPO Database: People in Indoor ROoms with Perspective and Omnidirectional cameras The PIROPO database (People in Indoor ROoms with Perspective and Omnidirectional cameras) comprises multiple sequences recorded in two different indoor rooms, u... people surveillance perspective omnidirectional fisheye indoor room detection human link 2017-02-16 449
326 Desk3D (Cambridge University) Instance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V., & Cipo... depth instance pose detection link 2016-04-15 355
322 Kendall Square Webcam The Kendall Square webcam dataset consists of two streams for one sunny day and one cloudy day of a city square. It is used for tracking and analyzing color cha... webcam color weather change detection appearance sky link 2016-03-02 431
317 NYU Symmetry Database The mirror symmetry database contains 176 single-symmetry and 63 multyple-symmetry images (.png files) with accompanying ground-truth annotations (.mat files). ... symmetry detection mirror groundtruth link 2016-04-15 296
314 WIDER FACE: A Face Detection Benchmark WIDER FACE dataset is a large-scale face detection benchmark dataset with 32,203 images and 393,703 face annotations, which have high degree of variabilities in... face detection scale pose occlusion link 2016-02-11 640
309 Coutour patches The contour patches dataset is a large dataset of images patch matches used for contour detection. References: C. L. Zitnick and D. Parikh The Role of Im... patch image match contour edge lowlevel detection segmentation link 2015-09-29 357
307 HandNet annotated hand dataset The HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 214971 a... hands articulated segmentation classification detection pose fingertip link 2015-09-07 586
302 CMP map2photo The CMP map2photo dataset consists of 6 pairs, where one image is satellite photo and second image is a map of the same area. The task is to match these images... feature detection description matching map remote sensing wide baseline link 2015-08-13 426
301 CMP Extreme Zoom Dataset The Extreme Zoom Dataset. EZD is a 6 image sets with incleasing zoom factor from general scene view to focusing on single detail. MODS: Fast and Robust Metho... feature detection description matching viewpoint zoom link 2015-07-15 367
300 CMP WxBS dataset The Wide (multiple) Baseline Dataset. 31 image pairs, simultaneously combining several nuisance factors: geometry, illumination, IR-visible, etc. WxBS: Wide ... feature detection description matching viewpoint IR day night link 2015-07-15 539
288 Berkeley Urban Street tracking The UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted on a ca... tracking detection segmentation multitarget recognition video pedestrian urban human link 2015-07-14 893
287 INRIA Lafarge Benchmarks Some datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting Line-ne... 3d surface reconstruction groundtruth pointcloud object detection line road network urban crowd pedestrian counting link 2015-06-18 665
286 HDA Person Dataset - ISR Lisbon The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestrian De... Video Surveillance Pedestrian Detection Re-Identification Multiview Tracking Benchmark Indoor High-Definition Camera Network lisbon human link 2017-01-26 1224
284 TRANCOS Overlapping Car Crowds The TRaffic ANd COngestionS (TRANCOS) dataset, a novel benchmark for (extremely overlapping) vehicle counting in traffic congestion situations. It consists of 1... object detection car transportation vehicle highway urban spain traffic link 2015-06-16 755
280 Yahoo Flickr Creative Commons 100M Yahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. All the ... flickr landmark image recognition detection reconstruction 3d clustering social community internet link 2015-09-24 660
279 WWW Crowd The Where Who Why (WWW) dataset provides 10,000 videos with over 8 million frames from 8,257 diverse scenes, therefore offering a superior comprehensive dataset... surveillance crowd pedestrian detection recognition flow optical video link 2015-05-27 765
275 TST fall detection It is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, with diff... action recognition detection depth kinect wearable accelerometer human video link 2017-03-14 739
273 SBMI 2015 Scene Background Initialization (SBI) dataset The SBI dataset has been assembled in order to evaluate and compare the results of background initialization al... change detection background initialization foreground benchmark link 2015-05-02 415
272 Stanford 40 Actions The Stanford 40 Actions dataset contains images of humans performing 40 actions. In each image, we provide a bounding box of the person who is performing the ac... human action recognition detection boundingbox link 2015-06-19 674
263 Crowd Dataset The crowd datasets are collected from a variety of sources, such as UCF and data-driven crowd datasets. The sequences are diverse, representing dense crowd in t... crowd video detection anomaly scene understanding human pedestrian link 2016-11-05 1112
262 PHOS (Evaluating illumination invariance) Phos is a color image database of 15 scenes captured under different illumination conditions. Every scene of the database contains 15 different images: 9 images... Illumination invariance, real lighting conditions, uneven illumination, shadows, feature detection link 2017-03-20 518
257 FaceScrub The FaceScrub dataset comprises a total of 107818 unconstrained face images of 530 celebrities crawled from the Internet, with about 200 images per person. M... face detection recognition celebrity people human link 2014-11-24 682
256 Multi-Task Facial Landmark (MTFL) dataset This dataset contains 12,995 face images which are annotated with (1) five facial landmarks, (2) attributes of gender, smiling, wearing glasses, and head pose. ... face, landmark detection, deep learning, cnn, attribute link 2015-11-07 1096
254 ChokePoint Dataset We collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions using e... human pedestrian identification recognition multiview sequence face detection real world surveillance clustering link 2015-05-02 922
253 Street View House Number (SVHN) SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and formatti... streetview number recognition classification urban streetside detection text real world link 2016-08-24 815
252 Volleyball Activity Dataset 2014 This dataset contains 7 challenging volleyball activity classes annotated in 6 videos from professionals in the Austrian Volley League (season 2011/12). A total... action activity sport volleyball detection recognition video analysis link 2014-10-23 1135
248 VIDEO datasets overview Many different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a handy spread... video benchmark recognition classification detection object action link 2014-09-30 939
247 PASCAL VOC Parts The PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation mask for 2... detection recognition pascal object part pedestrian human segmentation semantic link 2014-09-30 994
242 Stanford Dogs Dataset The Stanford Dogs dataset contains images of 120 breeds of dogs from around the world. This dataset has been built using images and annotation from ImageNet for... classification, detection, fine-grained categorization, dogs link 2015-07-29 1057
240 Microsoft COCO The Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other features: Mo... object context segmentation detection recognition benchmark semantic link 2015-05-02 1183
239 CUHK crowd dataset CUHK crowd dataset introduces the largest publicly available crowd dataset of 474 videos from 215 crowded scenes. It has been used in the paper: Scene-Ind... crowd analysis, group detection and analysis, scene understanding link 2016-09-14 1028
232 Pratheepan Human Skin Detection Dataset The images in this dataset are downloaded randomly from Google for human skin detection research. It has been used in the paper: W.R. Tan, C.S. Chan, Y. Prathee... skin detection, skin segmentation, human detection, skin dataset link 2016-11-05 1643
227 Omnidirectional and panoramic image dataset We share our omnidirectional and panoramic image dataset (with annotations) to be used for human and car detection. Please reach through: panorama detection car omnidirection human recognition link 2017-01-13 1000
225 California-ND An Annotated Dataset For Near-Duplicate Detection In Personal Photo Collections Managing photo collections involves a variety of image quality assessment tas... retrieval duplicate copyright groundtruth detection link 2014-03-19 581
224 CMP Extreme View Dataset 15 wide baseline stereo image pairs with large viewpoint change, provided ground truth homographies. Image size (~1000x700 pixels, RGB) D. Mishkin and M. ... feature detection description matching viewpoint link 2015-07-15 744
222 Ford Car Dataset The Ford Car dataset is joint effort of Pandey et al. (for collecting images, Lidar points, calibration etc.) and us (for annotation of 2D and 3D objects). ... car detection lidar 3d groundtruth sfm link 2014-04-16 1463
221 EPFL Multi-View Cars Th EPFL Multi-View Car dataset contains 20 sequences of cars as they rotate by 360 degrees. There is one image approximately every 3-4 degrees. Using the time o... pose multiview car detection estimation rotation link 2014-02-10 854
217 Youtube-Objects dataset The YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 videos for... video object detection segmentation flow optical link 2014-02-03 810
216 CVC Partial Occlusion Virtual Pedestrian The CVC Partial Occlusion Virtual Pedestrian datasets (CVC-01 to CVC-06) cover a range of scenarios of occluded pedestrians generated in a virtual and real envi... detection classification tracking pedestrian synthetic urban occlusion link 2016-03-15 1036
213 ChairGest Gestures ChairGest is an open challenge / benchmark. The task consists in spotting and recognizing gestures from multiple synchronized sensors: 1 Kinect and 4 Xsens Ine... benchmark recognition kinect gesture detection human link 2014-06-06 528
210 Traffic Video dataset The Traffic Video dataset consists of X video of an overhead camera showing a street crossing with multiple traffic scenarios. The dataset can be downloaded ... urban traffic tracking detection overhead view road video link 2014-02-03 2231
201 50 Salads The dataset captures 25 people preparing 2 mixed salads each and contains over 4h of annotated accelerometer and RGB-D video data. Annotated activities correspo... action activity recognition classification detection tracking video link 2013-10-05 722
199 THUR15000 We introduce a labeled dataset of categorized images for evaluating sketch based image retrieval. Using Flickr, we downloaded about 3000 images for each of the ... group saliency object detection visual attention sketch shape retrieval internet link 2013-10-08 750
198 THUS10000 The THUS10000 benchmark dataset comprises of 10,000 images, each of which has an unambiguous salient object and the object region is accurately annotated with p... segmentation saliency object detection visual attention link 2015-01-11 880
193 City planar and non-planar The city planar and non-planar datset consists of urban scenes accompanied by text files describing the plane/non-plane locations. Training Set (University)... plane detection 3d urban building estimation link 2013-09-23 542
190 Daimler Mono Pedestrian Detection Benchmark The Daimler Mono Pedestrian Detection Benchmark dataset contains a large training and test set. The training set contains 15.560 pedestrian samples (image cut-o... pedestrian detection outdoor urban mono scale object link 2013-09-18 773
188 KTH Multiview Football The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ... multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget link 2016-09-18 1066
187 Aspect Layout dataset The Aspect Layout dataset is designed to allow evaluation of object detection for aspect ratios in perspective images. Author text: In this project we see... detection object aspect ratio perspective layout link 2013-09-06 490
182 MSR Action The MSR Action datasets is a collection of various 3D datasets for action recognition. See details video action recognition detection reconstruction 3d link 2013-09-05 730
169 QMUL Junction Dataset The QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frames) Fra... detection tracking crowd counting pedestrian video motion behavior link 2016-12-06 1070
168 Mall Dataset The Mall dataset was collected from a publicly accessible webcam for crowd counting and profiling research. Ground truth: Over 60,000 pedestrians were label... detection tracking crowd counting pedestrian indoor video webcam link 2016-12-06 1281
166 ICG Multi-Camera Datasets The ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (crowded sc... multiview pedestrian tracking detection object camera calibration graz indoor video multitarget link 2015-06-19 974
165 ICG Multi-Camera and Virtual PTZ The ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video from a sph... multiview pedestrian tracking detection object camera calibration graz network video panorama crowd outdoor multitarget link 2015-06-19 1016
164 ICG Lab 6 (Multi-Camera Multi-Object Tracking) The ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1347 came... multiview pedestrian tracking detection object laboratory camera calibration evaluation segmentation graz link 2013-10-08 1304
163 TUGRAZ ICG Longterm Pedestrian Dataset The Longterm Pedestrian dataset consists of images from a stationary camera running 24 hours for 7 days at about 1 fps. It used for adaptive detection and back... pedestrian change detection background illumination robust indoor coffee graz multitarget link 2015-06-19 799
161 ICG Annotated Facial Landmarks in the Wild (AFLW) The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet... face detection landmark pose age annotation link 2016-12-13 1463
160 Caltech Lanes Dataset The Caltech Lanes dataset includes four clips taken around streets in Pasadena, CA at different times of day. The archive below includes 1225 individual frame... urban road lane detection caltech pasadena link 2013-08-08 796
157 Background Models Challenge (BMC) Background Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concern: -... background modeling change motion detection surveillance video segmentation link 2016-02-24 1149
151 People in WBCN This dataset is for people tracking in wide baseline camera networks and was designed as a contest at ICPR 2012. The contest consists of two challenges: ... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial link 2013-08-02 1053
150 SDHA Contest The Semantic Description of Human Activities (SDHA) was a contest at ICPR 2010. The contest is composed of three different types of activity recognition cha... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial link 2013-07-31 761
147 FlickrLogos-32 The FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval methods... flickr, logo, detection, retrieval, image, object recognition, machine learning, classification brand boundingbox link 2017-01-11 938
142 German Traffic Sign Recognition Benchmark The German Traffic Sign Recognition Benchmark is a dataset for multi-class detection problem in natural images and do cordially invite you to participate. The b... detection, traffic, urban, recognition link 2016-08-15 1209
138 Buffy The Buffy dataset contains images selected from the TV series, Buffy: the Vampire Slayer. We select a set of 452 images from the first two episodes for training... segmentation, detection, buffy, movie, human link 2015-02-07 579
113 Penn-Fudan Pedestrian Penn-Fudan Pedestrian Detection and Segmentation... pedestrian detection segmentation background motion link 2013-08-08 696
107 BIWI Pedestrians We provide the three datasets used for testing our system for our ICCV 2007 publication, including annotations. Data was recorded using a pair of AVT Marlins mo... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion link 2013-03-12 910
106 BIWI Walking Pedestrians (EWAP) The BIWI Walking Pedestrians (EWAP) dataset shows walking pedestrians in busy scenarios from a bird eye view. Manually annotated. Data used for training in our ... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial link 2013-08-02 1295
99 BSDS500 This new dataset is an extension of the BSDS300, where the original 300 images are used for training / validation and 200 fresh images, together with human anno... segmentation, edge, contour, detection link 2013-03-12 708
98 BSDS300 The goal of this work is to provide an empirical basis for research on image segmentation and boundary detection. To this end, we have collected 12,000 hand-la... segmentation, edge, contour, detection link 2013-03-12 710
95 Stroke Width Transform Text Stroke Width Transform Text dataset is by Boris Epstein and consists of 307 images and XXX text instances. Detecting Text in Natural Scenes with Stroke Wid... text, detection, recognition, classification link 2015-04-24 827
94 Chars74K The Chars74K dataset consists of 64 classes (0-9, A-Z, a-z), 7705 characters obtained from natural images, 3410 hand drawn characters using a tablet PC, 62992 s... text, detection, recognition, classification link 2016-04-22 1093
93 Street View Text The Street View Text (SVT) dataset contains 647 words and 3796 letters in 249 images harvested from Google Street View. The dataset is more challenging becaus... text, detection, recognition, classification, outdoor, urban link 2014-01-13 835
92 ICDAR 2011 This challenge is set up around three tasks: Text Localisation, Text Segmentation and Word Recognition. Participation in any or all tasks is welcome. Check the ... text, detection, recognition, classification link 2016-06-01 633
91 ICDAR 2003 The ICDAR 2003 datasets available for download on this site: Robust Reading , Robust Word Recognition , Robust OCR , Text Locating and Cursive Script . Pleas... text, detection, recognition, classification link 2013-03-12 741
88 Change Detection The dataset folder contains 7 folders (one for each category). Each category folder contains 4 to 6 folders (one for each video). Each video folder contains: ... change, detection, background link 2013-03-13 584
86 ICG Graz240 The ICG Graz240 dataset consists of 240 buildings with 5400 redundant images with a total of 5542 window instances. Window detection itself is difficult due to ... segmentation, detection, semantic, urban, graz link 2016-03-29 729
79 LabelMe The goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by visitin... segmentation, semantic, outdoor, detection, urban, software link 2013-03-14 631
78 Caltech Pedestrian The Caltech Pedestrian Dataset consists of approximately 10 hours of 640x480 30Hz video taken from a vehicle driving through regular traffic in an urban environ... pedestrian, detection, urban link 2016-06-07 1393
77 Daimler Stereo Pedestrian Daimler Stereo Pedestrian Detection Benchmark C. Keller, M. Enzweiler, and D. M. Gavrila, A New Benchmark for Stereo-based Pedestrian Detection, Proc. of th... pedestrian, detection, urban link 2013-03-13 629
76 Daimler Pedestrian Classification Daimler Multi-Cue, Occluded Pedestrian Classification Benchmark Training and test samples have a resolution of 48 x 96 pixels with a 12-pixel border around t... detection, classification, pedestrian, urban link 2013-03-11 663
75 ETHZ Shape The ETHZ Shape classes dataset from Vittorio Ferrari [?] consists of five object classes and a total of 255 images. All classes contain significant intra-class ... shape, detection, matching, segmentation, clutter, applelogo, bottle, giraffe, nature, swan, mug link 2014-02-11 653
68 The KITTI Vision Benchmark Suite We take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest are: ste... stereo, depth, flow, detection tracking, reconstruction, sfm, odometry, segmentation, semantic car depth link 2014-02-10 1048
62 Deformed Lattice Detection The Deformed Lattice Detection In Real-World Images dataset is used for regular grid detection. The authors have developed a robust and fast lattice detection a... texture, segmentation, symmetry, lattice, detection, urban link 2013-03-11 617
60 PSU HUB The PSU HUB dataset is a detection, tracking dataset. Ground truth trajectory and grouping information for pedestrians walking in the PSU student union building... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion link 2013-07-19 799
58 INRIA Horses The INRIA Horses dataset from Frederic Jurie and Vittorio Ferrari consists of 170 images with one or more horses in side-view at several scales and cluttered ba... detection, shape, segmentation, clutter, nature, horse link 2013-03-11 563
57 Weizmann Horses The multi-scale Weizmann horses (originally from Eran Borenstein, adapted by Jamie Shotton) consists of 656 images which is split into 50+50training, 50+50 vali... detection, shape, segmentation, clutter, nature, horse link 2013-03-11 867
56 ETHZ Extended Shape The ETHZ Extended Shape classes dataset from Konrad Schindler is larger dataset of shape categories, created by merging ETHZ shape classes with Konrad Schindler... detection, shape, segmentation, clutter link 2013-03-11 567
53 DTU Robot The DTU Robot dataset consists of color images of 60 scenes acquired in a controlled setup from 119 different positions and under different lighting. For each s... feature, detection, description, matching, sfm, reconstruction, illumination link 2016-05-15 671
52 Graffiti The Graffiti dataset by Krystian Mikolajczyk and Cordelia Schmid contains 48 images split into 8 sequences with 6 images each showing different structured and t... feature, detection, description, rectification, benchmark link 2017-02-23 642
29 The Yale Face The Yale Face dataset from A. Georghiades contains 5760 single light source images of ten subjects, each shown in 9 poses and 64 illumination setups (leading to... face, pedestrian, detection, pose, illumination link 2015-06-23 611
28 CMU Faces - Frontal faces The MIT + CMU frontal face dataset from H. Rowley contains 130 images with 507 labeled frontal faces from movie, portrait and media sources. It is mostly graysc... frontview, face, detection object boundingbox link 2015-06-19 665
25 PASCAL VOCs The PASCAL VOC Challenge datasets by Mark Everingham is a yearly dataset which has a central evaluation server and the final test data is not released. The late... detection segmentation pose pedestrian chair animal car building airplane link 2017-03-09 822
24 UIUC Cars This UIUC Cars dataset by Shivani Agarwal, Aatif Awan and Dan Roth contains images of side views of cars for use in evaluating object detection algorithms. The ... car, sideview, detection, scale, recognition, urban, scale link 2013-10-08 863
23 Graz02 The Graz02 dataset by Andreas Opelt and Axel Pinz contains four categories of images: bikes, people, cars and a single background class. The annotation has been... bike, pedestrian, background, detection, clutter, car, graz link 2014-04-24 865
22 Graz01 The Graz01 dataset by Andreas Opelt and Axel Pinz contains four types of images: bikes, people, background with no bikes, background with no people.... bike, pedestrian, background, detection, clutter, graz, occlusion link 2013-08-08 839
18 Leeds Cows The Leeds Cows dataset by Derek Magee consists of 14 different video sequences showing a total of 18 cows walking from right to left in front of different backg... detection segmentation cow video background animal link 2013-08-08 703
17 TUD Motorbike The TUD Motorbike dataset from Bastian Leibe contains 115 images collected from the internet. Each image contains one or more motorbikes at different scales and... motorbike, detection, pascal link 2013-08-08 792
16 PETS 2009 The PETS 2009 dataset contains 3 parts showing multi-view sequences containing pedestrians walking in an outdoor environment. The parts are used for person coun... frontview, outdoor, pedestrian, detection, tracking, overlap, occlusion multitarget, human link 2015-06-19 1131
15 PETS 2006 The PETS 2006 dataset contains 7 parts showing multi-sensor sequences containing left-luggage scenarios with increasing scene complexity at a train station scen... frontview, indoor, pedestrian, detection, tracking, multitarget link 2015-08-12 891
14 INRIA People The INRIA People dataset from Navneet Dalal and Bill Triggs [DalalCVPR2005] consists of training and testing data. The training contains 1805 images and X peopl... detection, pedestrian, sideview, frontview, human, boundingbox link 2015-06-19 1050
13 CBCL / MIT Pedestrian MIT Pedestrian dataset from Papageorgiou and Poggio [IJCV2000] contains 509 training and 200 test images of pedestrians in city scenes (plus left-right reflecti... pedestrian, frontview, detection, urban, people, boundingbox link 2015-06-19 773
9 TUD Crossing tracking The TUD Crossing dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 201 images with 1008 highly overlapping pedestrians with significant va... tracking detection segmentation multitarget pedestrian sideview overlap urban link 2015-06-19 1430

total views: 87053 5 queries in 8.4877014160156E-5s 8.082389831543E-5s 0.00013899803161621s 0.00011587142944336s 0.0027158260345459s and total 0.0086040496826172s