Putting Objects in Perspective

Derek Hoiem, Alexei A. Efros, and Martial Hebert
International Journal of Computer Vision, Vol. 80, No. 1, October, 2008


Download
  • Adobe portable document format (pdf) (7MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
Image understanding requires not only individually estimating elements of the visual world but also capturing the interplay among them. In this paper, we provide a framework for placing local object detection in the context of the overall 3D scene by modeling the interdependence of objects, surface orientations, and camera viewpoint. Most object detection methods consider all scales and locations in the image as equally likely. We show that with probabilistic estimates of 3D geometry, both in terms of surfaces and world coordinates, we can put objects into perspective and model the scale and location variance in the image. Our approach reflects the cyclical nature of the problem by allowing probabilistic object hypotheses to refine geometry and vice-versa. Our framework allows painless substitution of almost any object detector and is easily extended to include other aspects of image understanding. Our results confirm the benefits of our integrated approach.

Notes
Number of pages: 17

Text Reference
Derek Hoiem, Alexei A. Efros, and Martial Hebert, "Putting Objects in Perspective," International Journal of Computer Vision, Vol. 80, No. 1, October, 2008

BibTeX Reference
@article{Hoiem_2008_6352,
   author = "Derek Hoiem and Alexei A. Efros and Martial Hebert",
   title = "Putting Objects in Perspective",
   journal = "International Journal of Computer Vision",
   month = "October",
   year = "2008",
   volume = "80",
   number = "1",
}