|
|
|
|
RI | Publications | Informedia News-on-Demand: Using Speech Recognition to Create a Digital Video Library
|
|
Text only version of this site
Informedia News-on-Demand: Using Speech Recognition to Create a Digital Video Library
H. Wactlar, A. Hauptmann, and M.J. Witbrock
tech. report CMU-CS-98-109, Computer Science Department, Carnegie Mellon University, March, 1998.
Jump to: Download | Abstract | Notes | Text Reference | BibTeX Reference
| Download [Help] |
Adobe portable document format (pdf) [46 KB]
Compressed postscript (ps.gz) [85 KB]
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.
| Abstract |
In theory, speech recognition technology can make any spoken words in video or audio media usable for text indexing, search and retrieval. This article describes the News-on-Demand application created within the InformediaTM Digital Video Library project and discusses how speech recognition is used in transcript creation from video, alignment with closed-captioned transcripts, audio paragraph segmentation and a spoken query interface. Speech recognition accuracy varies dramatically depending on the quality and type of data used. Informal information retrieval tests show that reasonable recall and precision can be obtained with only moderate speech recognition accuracy.
| Notes |
Associated center: VASC
Associated project: Informedia Digital Video Library
| Text Reference |
H. Wactlar, A. Hauptmann, and M.J. Witbrock, Informedia News-on-Demand: Using Speech Recognition to Create a Digital Video Library, tech. report CMU-CS-98-109, Computer Science Department, Carnegie Mellon University, March, 1998.
| BibTeX Reference |
@techreport{Wactlar_1998_3732,
author = "Howard Wactlar and Alex Hauptmann and M.J. Witbrock",
title = "Informedia News-on-Demand: Using Speech Recognition to Create a Digital Video Library",
institution = "Computer Science Department, Carnegie Mellon University",
month = "March",
year = "1998",
number = "CMU-CS-98-109",
address = "Pittsburgh, PA"
}