The Robotics Institute
Search the site
RI | Publications | Text Detection and Translation from Natural Scenes

Text only version of this site

Text Detection and Translation from Natural Scenes
J. Gao, J. Yang, Y. Zhang, and unknown
tech. report CMU-CS-01-139, Computer Science Department, Carnegie Mellon University, June, 2001.

Jump to: Download | Abstract | Text Reference | BibTeX Reference

Download [Help]

Adobe portable document format (pdf) [2583 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

We present a system for automatic extraction and interpretation of signs from a natural scene. The system is capable of capturing images, detecting and recognizing signs, and translating them into a target language. The translation can be displayed on a hand-held wearable display, or a head mounted display. It can also be synthesized as a voice output message over the earphones. We address challenges in automatic sign extraction and translation. We describe methods for automatic sign extraction. We extend example-based machine translation technology for sign translation. We use a user-centered approach in the system development. The approach takes advantage of human intelligence if needed and leverages human capabilities. We are currently working on Chinese sign translation. We have developed a prototype system that can recognize Chinese signs input from a video camera that is a common gadget for a tourist, and translate the signs either into English text or a voice stream. We have built up a database containing about 800 Chinese signs for development and evaluation. We present evaluation results and analyze errors. The sign translation, in conjunction with spoken language translation, can help international tourists to overcome language barriers. The technology can also help a visually handicapped person to increase environmental awareness.

Text Reference

J. Gao, J. Yang, Y. Zhang, and unknown, Text Detection and Translation from Natural Scenes, tech. report CMU-CS-01-139, Computer Science Department, Carnegie Mellon University, June, 2001.

BibTeX Reference

@techreport{Gao_2001_3966,
   author = "Jiang Gao and Jie Yang and Ying Zhang and unknown",
   title = "Text Detection and Translation from Natural Scenes",
   institution = "Computer Science Department, Carnegie Mellon University",
   month = "June",
   year = "2001",
   number = "CMU-CS-01-139",
   address = "Pittsburgh, PA"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu