The Robotics Institute
Search the site
RI | Publications | Video Skimming and Characterization through the Combination of Image and Language Understanding

Text only version of this site

Video Skimming and Characterization through the Combination of Image and Language Understanding
M. Smith and T. Kanade
Proceedings of the 1998 IEEE International Workshop on Content-Based Access of Image and Video Databases, January, 1998, pp. 61 - 70.

Jump to: Download | Abstract | Notes | Text Reference | BibTeX Reference

Download [Help]

Adobe portable document format (pdf) [1429 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

Digital video is rapidly becoming important for education, entertainment and a host of multimedia applications. With the size of the video collections growing to thousands of hours, technology is needed to effectively browse segments in a short time without losing the content of the video. We propose a method to extract the significant audio and video information and create a skim video which represents a very short synopsis of the original. The goal of this work is to show the utility of integrating language and image understanding techniques for video skimming by extraction of significant information, such as specific objects, audio keywords and relevant video structure. The resulting skim video is much shorter; where compaction is as high as 20:1, and yet retains the essential content of the original segment. We have conducted a user-study to test the content summarization and effectiveness of the skim as a browsing tool.

Notes

Associated center: VASC
Associated project: Informedia Digital Video Library

Text Reference

M. Smith and T. Kanade, "Video Skimming and Characterization through the Combination of Image and Language Understanding," Proceedings of the 1998 IEEE International Workshop on Content-Based Access of Image and Video Databases, January, 1998, pp. 61 - 70.

BibTeX Reference

@inproceedings{Smith_1998_2724,
   author = "Michael Smith and Takeo Kanade",
   title = "Video Skimming and Characterization through the Combination of Image and Language Understanding",
   booktitle = "Proceedings of the 1998 IEEE International Workshop on Content-Based Access of Image and Video Databases",
   month = "January",
   year = "1998",
   pages = "61 - 70"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu