[go: nahoru, domu]

Skip to content

tangzhenyu/Scene-Text-Understanding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 

Repository files navigation

Scene-Text-Understanding

Survey

  • [2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper
  • [2014-Front.Comput.Sci] Scene Text Detection and Recognition: Recent Advances and Future Trends paper

Scene Text Understanding

Scene Text Recognition

Dataset

COCO-Text (ComputerVision Group, Cornell) 2016

  • 63,686images, 173,589 text instances, 3 fine-grained text attributes.
  • Task:text location and recognition

COCO-Text API

Synthetic Data for Text Localisation in Natural Image (VGG)2016

  • 800k thousand images
  • 8 million synthetic word instances
  • download

Synthetic Word Dataset (Oxford, VGG) 2014

  • 9million images covering 90k English words
  • Task:text recognition, segmentation
  • download

IIIT 5K-Words 2012

  • 5000images from Scene Texts and born-digital (2k training and 3k testing images)
  • Eachimage is a cropped word image of scene text with case-insensitive labels
  • Task:text recognition
  • download

StanfordSynth(Stanford, AI Group) 2012

  • Small single-character images of 62 characters (0-9, a-z, A-Z)
  • Task:text recognition
  • download

MSRA Text Detection 500 Database(MSRA-TD500) 2012

  • 500 natural images(resolutions of the images vary from 1296x864 to 1920x1280)
  • Chinese,English or mixture of both
  • Task:text detection

Street View Text (SVT) 2010

  • 350 high resolution images (average size 1260 × 860) (100 images for training and 250 images for testing)
  • Only word level bounding boxes are provided with case-insensitive labels
  • Task:text location

KAIST Scene_Text Database 2010

  • 3000 images of indoor and outdoor scenes containing text
  • Korean,English (Number), and Mixed (Korean + English + Number)
  • Task:text location, segmentation and recognition

Chars74k 2009

  • Over 74K images from natural images, as well as a set of synthetically generatedcharacters

  • Smallsingle-character images of 62 characters (0-9, a-z, A-Z)

  • Task:text recognition

  • ICDAR Benchmark Datasets

Dataset Discription Competition Paper
ICDAR 2015 1000 training images and 500 testing images paper link
ICDAR 2013 229 training images and 233 testing images paper link
ICDAR 2011 229 training images and 255 testing images paper link
ICDAR 2005 1001 training images and 489 testing images paper link
ICDAR 2003 181 training images and 251 testing images(word level and character level) paper link

Blogs

Open Resources Code

Hand Writing Recognition

Licence Tag Recognition

About

OCR, Scene-Text-Understanding, Text Recognition

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published