RI Homepage Carnegie Mellon Homepage RI Homepage

The Robotics Institute

Carnegie Mellon Robotics Institute

Efficient Temporal Consistency for Streaming Video Scene Analysis

Ondrej Miksik, Daniel Munoz, J. Andrew (Drew) Bagnell, and Martial Hebert
tech. report CMU-RI-TR-12-30, Robotics Institute, Carnegie Mellon University, September, 2012


Download
  • Adobe portable document format (pdf) (1MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
We address the problem of image-based scene analysis from streaming video, as would be seen from a moving platform, in order to efficiently generate spatially and temporally consistent predictions of semantic categories over time. In contrast to previous techniques which typically address this problem in batch and/or through graphical models, we demonstrate that by learning visual similarities between pixels across frames, a simple filtering algorithm is able to achieve high performance predictions in an efficient and online/causal manner. Our technique is a meta-algorithm that can be efficiently wrapped around any scene analysis technique that produces a per-pixel semantic label distribution. We validate our approach over three different scene analysis techniques on three different datasets that contain different semantic object categories. Our experiments demonstrate our approach is very efficient in practice and substantially improves the quality of predictions over time.

Notes
Associated Center(s) / Consortia: Vision and Autonomous Systems Center
Number of pages: 7

Text Reference
Ondrej Miksik, Daniel Munoz, J. Andrew (Drew) Bagnell, and Martial Hebert, "Efficient Temporal Consistency for Streaming Video Scene Analysis," tech. report CMU-RI-TR-12-30, Robotics Institute, Carnegie Mellon University, September, 2012

BibTeX Reference
@techreport{Miksik_2012_7294,
   author = "Ondrej Miksik and Daniel Munoz and J. Andrew (Drew) Bagnell and Martial Hebert",
   title = "Efficient Temporal Consistency for Streaming Video Scene Analysis",
   booktitle = "",
   institution = "Robotics Institute",
   month = "September",
   year = "2012",
   number= "CMU-RI-TR-12-30",
   address= "Pittsburgh, PA",
}