The Robotics Institute
Search the site
RI | Publications | A Probabilistic Approach for the Adaptive Integration of Multiple Visual Cues Using an Agent Framework

Text only version of this site

A Probabilistic Approach for the Adaptive Integration of Multiple Visual Cues Using an Agent Framework
A. Soto
doctoral dissertation, tech. report CMU-RI-TR-02-30, Robotics Institute, Carnegie Mellon University, October, 2002.

Jump to: Download | Abstract | Notes | Text Reference | BibTeX Reference

Download [Help]

Adobe portable document format (pdf) [3197 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

Most of current machine vision systems for dynamic state estimation suffer from a lack of flexibility to account for the high variability of unstructured environments. As the state of the world evolves, the potential knowledge provided by different visual attributes can change, breaking the initial assumptions of a non-adaptive vision system. This thesis develops a new comprehensive computational framework for the adaptive integration of information from different visual algorithms.

This framework takes advantage of the richness of visual information by adaptively considering a variety of visual properties such as color, depth, motion, and shape. Using a probabilistic approach and uncertainty metrics, the resulting framework makes appropriate decisions about the most relevant visual attributes to consider.

The framework is based on an agent paradigm. Each visual algorithm is implemented as an agent that adapts its behavior according to uncertainty considerations. These agents act as a group of experts, where each agent has a specific knowledge area. Cooperation among the agents is given by a probabilistic scheme that uses Bayesian inference to integrate the evidential information provided by them.

To deal with the inherent no linearity of visual information, the relevant probability distributions are represented using a stochastic sampling approach. The estimation of the state of relevant visual structures is performed using an enhanced version of the particle filter algorithm. This enhanced version includes novel methods to adaptively select the number of samples used by the filter, and to adaptively find a suitable function to propagate the samples.

The implementation of the computational framework is performed using a distributed multi-agent software architecture. This is tested for the case of visual target tracking using a mobile platform. The evaluation of the implementation using computer simulations and real situations compares positively with current state of the art visual target tracking techniques.

Notes

Associated center: VASC
Associated labs/groups: Advanced Mechatronics Lab and Tele-Supervised Autonomous Robotics
Associated project: Wide Area Prospecting Using Supervised Autonomous Robots

Text Reference

A. Soto, A Probabilistic Approach for the Adaptive Integration of Multiple Visual Cues Using an Agent Framework, doctoral dissertation, tech. report CMU-RI-TR-02-30, Robotics Institute, Carnegie Mellon University, October, 2002.

BibTeX Reference

@phdthesis{Soto_2002_4115,
   author = "Alvaro Soto",
   title = "A Probabilistic Approach for the Adaptive Integration of Multiple Visual Cues Using an Agent Framework",
   school = "Robotics Institute, Carnegie Mellon University",
   month = "October",
   year = "2002",
   address = "Pittsburgh, PA"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu