The Robotics Institute
Search the site
RI | Publications | Monte Carlo EM for Data-Association and its Applications in Computer Vision

Text only version of this site

Monte Carlo EM for Data-Association and its Applications in Computer Vision
F. Dellaert
doctoral dissertation, tech. report CMU-CS-01-153, Computer Science Department, Carnegie Mellon University, September, 2001.

Jump to: Download | Abstract | Notes | Text Reference | BibTeX Reference

Download [Help]

Adobe portable document format (pdf) [5326 KB]
Compressed postscript (ps.gz) [10028 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

Estimating geometry from images is at the core of many computer vision applications, whether it concerns the imaging geometry, the geometry of the scene, or both. Examples include image mosaicking, pose estimation, multi-baseline stereo, and structure from motion. All these problems can be modeled probabilistically and translate into well-understood statistical estimation problems, provided the correspondence between measurements in the different images is known.

I will show that, if the correspondence is not known, the statistically optimal estimate for the geometry can be obtained using the expectation-maximization (EM) algorithm. In contrast to existing techniques, the EM algorithm avoids the estimation bias associated with computing a single "best" set of correspondences, but rather considers the distribution over all possible correspondences consistent with the data. While the latter computation is intractable in general, I show that it can be approximated well in practice using Markov chain Monte Carlo sampling. As part of this, I have designed an efficient sampler specifically tuned to the correspondence problem.

Abstract The resulting Monte Carlo EM approach represents the first truly multi-view algorithm for geometric estimation with unknown correspondence. This is especially relevant in the structure from motion domain, where the state of the art relies on robust estimation of two or three-view geometric constraints. In addition, I will show that the probabilistic approach I propose allows for a seamless and principled way of integrating prior knowledge, appearance models, and statistical models for occlusion and clutter.

Notes

Number of pages: 236

Text Reference

F. Dellaert, Monte Carlo EM for Data-Association and its Applications in Computer Vision, doctoral dissertation, tech. report CMU-CS-01-153, Computer Science Department, Carnegie Mellon University, September, 2001.

BibTeX Reference

@phdthesis{Dellaert_2001_3851,
   author = "Frank Dellaert",
   title = "Monte Carlo EM for Data-Association and its Applications in Computer Vision",
   school = "Computer Science Department, Carnegie Mellon University",
   month = "September",
   year = "2001",
   address = "Pittsburgh, PA"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu