Carnegie Mellon Robotics Institute
Howard Wactlar, Alex Hauptmann, and Michael Witbrock
Proceedings of ARPA Speech Recognition Workshop, February, 1996.
| Download |
|
| Abstract |
| In theory, speech recognition technology can make any spoken words in video or audio media usable for text indexing, search and retrieval. This article describes the News-on-Demand application created within the InformediaTM Digital Video Library project and discusses how speech recognition is used in transcript creation from video, alignment with closed-captioned transcripts, audio paragraph segmentation and a spoken query interface. Speech recognition accuracy varies dramatically depending on the quality and type of data used. Informal information retrieval test show that reasonable recall and precision can be obtained with only moderate speech recognition accuracy. |
| Notes |
Associated Center(s) / Consortia:
Vision and Autonomous Systems Center Associated Project(s):
Informedia Digital Video Library |
| Text Reference |
| Howard Wactlar, Alex Hauptmann, and Michael Witbrock, "Informedia: News-on-Demand Experiments in Speech Recognition," Proceedings of ARPA Speech Recognition Workshop, February, 1996. |
| BibTeX Reference |
|
@inproceedings{Wactlar_1996_3345, author = "Howard Wactlar and Alex Hauptmann and Michael Witbrock", title = "Informedia: News-on-Demand Experiments in Speech Recognition", booktitle = "Proceedings of ARPA Speech Recognition Workshop", publisher = "Arden House", address = "Harriman, NY", month = "February", year = "1996", } |
| The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University. Contact Us | Update Instructions |