This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 524 tags of 524 total tags for 372 datasets (1.41) »
|344||YACCLAB dataset||The YACCLAB dataset includes both synthetic and real binary images and is suitable for a wide range of applications, ranging from document processing to survail...||Labeling Binary Text Medical Fingerprints VideoSurveillance Natural RandomNoise||link||2017-01-20||190|
|253||Street View House Number (SVHN)||SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and formatti...||streetview number recognition classification urban streetside detection text real world||link||2016-08-24||812|
|167||Text and Vision (TVGraz) Dataset||The Text and Vision (TVGraz) dataset is an annotated multi-modal dataset which currently contains 10 visual object categories, 4030 images and associated text. ...||text appearance classification evaluation||link||2017-01-10||818|
|144||MNIST hand-written letters||The MNIST database of handwritten digits, available from this page, has a training set of 60,000 examples, and a test set of 10,000 examples. It is a subset of ...||text, classification, letter||link||2017-02-06||1407|
|96||USPS Handwritten Digits||Name: Classes Train. Ex. Test. Ex. Features USPS 10 7291 2007 256 8-bit grayscale images of "0" through "9"; handwritten digits; ...||text, recognition, classification, handwritten||link||2013-03-12||766|
|95||Stroke Width Transform Text||Stroke Width Transform Text dataset is by Boris Epstein and consists of 307 images and XXX text instances. Detecting Text in Natural Scenes with Stroke Wid...||text, detection, recognition, classification||link||2015-04-24||826|
|94||Chars74K||The Chars74K dataset consists of 64 classes (0-9, A-Z, a-z), 7705 characters obtained from natural images, 3410 hand drawn characters using a tablet PC, 62992 s...||text, detection, recognition, classification||link||2016-04-22||1092|
|93||Street View Text||The Street View Text (SVT) dataset contains 647 words and 3796 letters in 249 images harvested from Google Street View. The dataset is more challenging becaus...||text, detection, recognition, classification, outdoor, urban||link||2014-01-13||832|
|92||ICDAR 2011||This challenge is set up around three tasks: Text Localisation, Text Segmentation and Word Recognition. Participation in any or all tasks is welcome. Check the ...||text, detection, recognition, classification||link||2016-06-01||631|
|91||ICDAR 2003||The ICDAR 2003 datasets available for download on this site: Robust Reading , Robust Word Recognition , Robust OCR , Text Locating and Cursive Script . Pleas...||text, detection, recognition, classification||link||2013-03-12||741|
|27||Idiap/ETHZ Faces and Poses||Idiap/ETHZ Faces and Poses Dataset dataset by L. Jie, B. Caputo and V. Ferrari contains 1703 image-caption pairs. [author] Captions contain the names of some of...||face, pose, pedestrian, text||link||2013-03-11||649|