Visual Utility – A Framework for Focusing Computer Vision Algorithms

Mark Desnoyer
PhD Thesis, Tech. Report, CMU-RI-TR-15-32, Robotics Institute, Carnegie Mellon University, September, 2015

View Publication

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.


The real world is a rich environment, fraught with complexity. To be robust in this complex environment, computer Vision algorithms that operate in Unstructured Environments (VUE) tend to use large amounts of data or complex modeling. Unfortunately, these algorithms also require signicant computational resources. In this thesis, we examine a visual utility framework that we show is used regularly in an ad hoc manner. This framework uses visual utility estimators to speed up VUE algorithms with minimal performance degradation by focusing those higher level algorithms on the most relevant imagery. We formally dene this framework, show that it has a submodular structure and discover under what conditions using it is valuable in practice. We nd that visual utility approaches are most eective when using fast, task specic visual utility estimators and the VUE task is computationally expensive, We also introduce SCATAT, a cascade building algorithm. SCATAT takes advantage of the submodular structure of the visual utility framework in order to build a near optimal cascade that trades o task performance and processing requirements explicitly. We then validate this algorithm in an experimental case study on object detection. Finally, in theoretical case studies, we prove that two seminal cascade algorithms are special cases of our visual utility framework. We show they optimize a submodular visual utility function, explaining their high observed performance in practice.

author = {Mark Desnoyer},
title = {Visual Utility – A Framework for Focusing Computer Vision Algorithms},
year = {2015},
month = {September},
school = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-15-32},
} 2017-09-13T10:38:34-04:00