Carnegie Mellon Robotics Institute
Poj Tangamchit, John M. Dolan, and Pradeep Khosla
IEEE Conference on Robotics and Automation 2002, May, 2002.
| Download |
|
| Abstract |
| Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular single-robot learning algorithms based on discounted rewards, such as Q learning, do not achieve cooperation (i.e., purposeful division of labor) when applied to task-level multirobot systems. A task-level system is defined as one performing a mission that is decomposed into subtasks shared among robots. In this paper, we demonstrate the superiority of average-reward-based learning such as the Monte Carlo algorithm for task-level multirobot systems, and suggest an explanation for this superiority. |
| Keywords |
| Cooperation, Multirobot, Learning |
| Notes |
| Text Reference |
| Poj Tangamchit, John M. Dolan, and Pradeep Khosla, "The Necessity of Average Rewards in Cooperative Multirobot Learning," IEEE Conference on Robotics and Automation 2002, May, 2002. |
| BibTeX Reference |
|
@inproceedings{Tangamchit_2002_3974, author = "Poj Tangamchit and John M Dolan and Pradeep Khosla", title = "The Necessity of Average Rewards in Cooperative Multirobot Learning", booktitle = "IEEE Conference on Robotics and Automation 2002", month = "May", year = "2002", } |
| The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University. Contact Us | Update Instructions |