The SPHINX speech recognition system

K.-F. Lee, H.-W. Hon, M.-Y. Hwang, S. Mahajan, and Raj Reddy
1989 International Conference on Acoustics, Speech, and Signal Processing (ICASSP '89), May, 1989, pp. 445 - 448.


Download
  • Adobe portable document format (pdf) (253KB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
A description is given of SPHINX an accurate large-vocabulary speaker-independent continuous speech recognition system. The authors have made several recent enhancements, including generalized triphone models, word duration modeling, function-phrase modeling, between-word coarticulation modeling, and corrective training. On the 997-word resource management task, SPHINX attained a word accuracy of 96% with a grammar (perplexity 60), and 82% without grammar (perplexity 997).

Notes

Text Reference
K.-F. Lee, H.-W. Hon, M.-Y. Hwang, S. Mahajan, and Raj Reddy, "The SPHINX speech recognition system," 1989 International Conference on Acoustics, Speech, and Signal Processing (ICASSP '89), May, 1989, pp. 445 - 448.

BibTeX Reference
@inproceedings{Reddy_1989_3708,
   author = "K.-F. Lee and H.-W. Hon and M.-Y. Hwang and S. Mahajan and Raj Reddy",
   title = "The SPHINX speech recognition system",
   booktitle = "1989 International Conference on Acoustics, Speech, and Signal Processing (ICASSP '89)",
   pages = "445 - 448",
   month = "May",
   year = "1989",
   volume = "1",
}