The MOPED framework: Object recognition and pose estimation for manipulation

Alvaro Collet Romea, Manuel Martinez Torres, and Siddhartha Srinivasa
International Journal of Robotics Research, Vol. 30, No. 10, September, 2011, pp. 1284 - 1306.


Download
  • Adobe portable document format (pdf) (3MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
We present MOPED, a framework for Multiple Object Pose Estimation and Detection that seamlessly integrates single-image and multi-image object recognition and pose estimation in one ptimized, robust, and scalable framework. We address two main challenges in computer vision for robotics: robust performance in complex scenes, and low latency for real-time operation. We achieve robust performance with Iterative Clustering Estimation (ICE), a novel algorithm that iteratively combines feature clustering with robust pose estimation. Feature clustering quickly partitions the scene and produces object hypotheses. The hypotheses are used to further refine the feature clusters, and the two steps iterate until convergence. ICE is easy to parallelize, and easily integrates single- and multi-camera object recognition and pose estimation. We also introduce a novel object hypothesis scoring function based on M-estimator theory, and a novel pose clustering algorithm that robustly handles recognition outliers. We achieve scalability and low latency with an improved feature matching algorithm for large databases, a GPU/CPU hybrid architecture that exploits parallelism at all levels, and an optimized resource scheduler. We provide extensive experimental results demonstrating state-of-the-art performance in terms of recognition, scalability, and latency in real-world robotic applications.

Notes
Number of pages: 23

Text Reference
Alvaro Collet Romea, Manuel Martinez Torres, and Siddhartha Srinivasa, "The MOPED framework: Object recognition and pose estimation for manipulation," International Journal of Robotics Research, Vol. 30, No. 10, September, 2011, pp. 1284 - 1306.

BibTeX Reference
@article{Collet_Romea_2011_6856,
   author = "Alvaro {Collet Romea} and Manuel {Martinez Torres} and Siddhartha Srinivasa",
   title = "The MOPED framework: Object recognition and pose estimation for manipulation",
   journal = "International Journal of Robotics Research",
   pages = "1284 - 1306",
   month = "September",
   year = "2011",
   volume = "30",
   number = "10",
}