Carnegie Mellon University
Improving Spatial Support for Objects via Multiple Segmentations

Tomasz Malisiewicz and Alexei A. Efros
British Machine Vision Conference (BMVC), September, 2007.

  • Adobe portable document format (pdf) (2MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Sliding window scanning is the dominant paradigm in object recognition research today. But while much success has been reported in detecting several rectangular-shaped object classes (i.e. faces, cars, pedestrians), results have been much less impressive for more general types of objects. Several researchers have advocated the use of image segmentation as a way to get a better spatial support for objects. In this paper, our aim is to address this issue by studying the following two questions: 1) how important is good spatial support for recognition? 2) can segmentation provide better spatial support for objects? To answer the first, we compare recognition performance using ground-truth segmentation vs. bounding boxes. To answer the second, we use the multiple segmentation approach to evaluate how close can real segments approach the ground-truth for real objects, and at what cost. Our results demonstrate the importance of finding the right spatial support for objects, and the feasibility of doing so without excessive computational burden.

segmentation, object recognition

Number of pages: 10

Text Reference
Tomasz Malisiewicz and Alexei A. Efros, "Improving Spatial Support for Objects via Multiple Segmentations," British Machine Vision Conference (BMVC), September, 2007.

BibTeX Reference
   author = "Tomasz Malisiewicz and Alexei A. Efros",
   title = "Improving Spatial Support for Objects via Multiple Segmentations",
   booktitle = "British Machine Vision Conference (BMVC)",
   month = "September",
   year = "2007",