Carnegie Mellon Robotics Institute
Michael Smith and Takeo Kanade
tech. report CMU-CS-97-111, Computer Science Department, Carnegie Mellon University, February, 1997
| Download |
|
| Abstract |
| Digital video is rapidly becoming important for education, entertainment, and a host of multimedia applications. With the size of the video collections growing to thousands of hours, technology is needed to effectively browse segments in a short time without losing the content of the video. We propose a method to extract the significant audio and video information and create a "skim" video which represents a very short synopsis of the original. The goal of this work is to show the utility of integrating language and image understanding techniques for video skimming by extraction of significant information, such as specific objects, audio keywords and relevant video structure. The resulting skim video is much shorter, where compaction is as high as 20:1, and yet retains the essential content of the original segment. |
| Notes |
Associated Center(s) / Consortia:
Vision and Autonomous Systems Center Associated Project(s):
Informedia Digital Video Library |
| Text Reference |
| Michael Smith and Takeo Kanade, "Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques," tech. report CMU-CS-97-111, Computer Science Department, Carnegie Mellon University, February, 1997 |
| BibTeX Reference |
|
@techreport{Smith_1997_3197, author = "Michael Smith and Takeo Kanade", title = "Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques", booktitle = "", institution = "Computer Science Department", month = "February", year = "1997", number= "CMU-CS-97-111", address= "Pittsburgh, PA", } |
| The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University. Contact Us | Update Instructions |