Neural Network Classifiers for Optical Chinese Character Recognition

Richard Romero, Robert W. Berger, Robert H. Thibadeau, and David S. Touretzky
1995.


Download
  • Adobe portable document format (pdf) (98KB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
We describe a new, publicly accessible Chinese character recognition system based on a nearest neighbor classifier that uses several sophisticated techniques to improve its performance. To increase throughput, a 400-dimensional feature space is compressed through multiple discriminant analysis techniques to 100 dimensions. Recognition accuracy is improved by scaling these dimensions to achieve uniform variance. Two neural network classifiers are compared using the new feature space, Kohonen's Learning Vector Quantization and Geva and Sitte's Decision Surface Mapping. Experiments with a 37,000 character ground truthed dataset show performance comparable to other systems in the literature. We are now employing noise and distortion models to quantify the robustness of the recognizer on realistic page images.

Notes
Associated Lab(s) / Group(s): Internet Systems Lab
Associated Project(s): Printed Chinese Character Recognition

Text Reference
Richard Romero, Robert W. Berger, Robert H. Thibadeau, and David S. Touretzky, "Neural Network Classifiers for Optical Chinese Character Recognition," 1995.

BibTeX Reference
@incollection{Romero_1995_2961,
   author = "Richard Romero and Robert W Berger and Robert H. Thibadeau and David S Touretzky",
   title = "Neural Network Classifiers for Optical Chinese Character Recognition",
   booktitle = "",
   year = "1995",
}