Video Skimming for Quick Browsing based on Audio and Image Characterization

Michael Smith and Takeo Kanade
tech. report CMU-CS-95-186, Computer Science Department, Carnegie Mellon University, July, 1995


Download
  • Adobe portable document format (pdf) (2MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
Digital video is rapidly becoming an important source for information, entertainment and a host of multimedia applications. With the size of these collections growing to thousands of hours, technology is needed to effectively browse segments in a short time without losing the content of the video. We propose a method to extract the significant audio and video information and create a "skim" video which represents a short synopsis of the original. The extraction of significant information, such as specific objects, audio keywords and relevant video structure, is made possible through the integration of techniques in image and language understanding. The resulting skim is much smaller, and retains the essential content of the original segment.

Notes
Associated Center(s) / Consortia: Vision and Autonomous Systems Center
Associated Project(s): Informedia Digital Video Library

Text Reference
Michael Smith and Takeo Kanade, "Video Skimming for Quick Browsing based on Audio and Image Characterization," tech. report CMU-CS-95-186, Computer Science Department, Carnegie Mellon University, July, 1995

BibTeX Reference
@techreport{Smith_1995_2545,
   author = "Michael Smith and Takeo Kanade",
   title = "Video Skimming for Quick Browsing based on Audio and Image Characterization",
   booktitle = "",
   institution = "Computer Science Department",
   month = "July",
   year = "1995",
   number= "CMU-CS-95-186",
   address= "Pittsburgh, PA",
}