Exemplar Driven Character Recognition in the Wild

Karthik Sheshadri and Santosh Kumar Divvala
British Machine Vision Conference (BMVC) 2012, September, 2012.


Download
  • Adobe portable document format (pdf) (1MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
Character recognition in natural scenes continues to represent a formidable challenge in computer vision. Beyond variation in font, there exist difficulties in occlusion, background clutter, binarisation, and arbitrary skew. Recent advances have leveraged state of the art methods from generic object recognition to address some of these challenges. In this paper, we extend the focus to Indic script languages (e.g., Kannada) that contain large character sets (order of 1000 classes unlike 62 in English) with very low inter character variation. We identify this scenario as a fine grained visual categorization task, and present a simple exemplar based multi layered classification approach to the problem. The proposed approach beats the existing state of the art on the chars74k and ICDAR datasets by over 10% for English, and around 24% for Kannada.

Notes

Text Reference
Karthik Sheshadri and Santosh Kumar Divvala, "Exemplar Driven Character Recognition in the Wild," British Machine Vision Conference (BMVC) 2012, September, 2012.

BibTeX Reference
@inproceedings{Sheshadri_2012_7257,
   author = "Karthik Sheshadri and Santosh Kumar Divvala",
   title = "Exemplar Driven Character Recognition in the Wild",
   booktitle = "British Machine Vision Conference (BMVC) 2012",
   month = "September",
   year = "2012",
}