Automatic weight learning for multiple data sources when learning from demonstration

Brenna Argall, Brett Browning, and Manuela Veloso

Conference Paper, Proceedings of (ICRA) International Conference on Robotics and Automation, pp. 226 - 231, May, 2009

View Publication

Abstract

Traditional approaches to programming robots are generally inaccessible to non-robotics-experts. A promising exception is the Learning from Demonstration paradigm. Here a policy mapping world observations to action selection is learned, by generalizing from task demonstrations by a teacher. Most Learning from Demonstration work to date considers data from a single teacher. In this paper, we consider the incorporation of demonstrations from multiple teachers. In particular, we contribute an algorithm that handles multiple data sources, and additionally reasons about reliability differences between them. For example, multiple teachers could be inequally proficient at performing the demonstrated task. We introduce Demonstration Weight Learning (DWL) as a Learning from Demonstration algorithm that explicitly represents multiple data sources and learns to select between them, based on their observed reliability and according to an adaptive expert learning inspired approach. We present a first implementation of DWL within a simulated robotic domain. Data sources are shown to differ in reliability, and weighting is found impact task execution success. Fur- thermore, DWL is shown to produce appropriate data source weights that improve policy performance.

BibTeX

@conference{Argall-2009-17075,
author = {Brenna Argall and Brett Browning and Manuela Veloso},
title = {Automatic weight learning for multiple data sources when learning from demonstration},
booktitle = {Proceedings of (ICRA) International Conference on Robotics and Automation},
year = {2009},
month = {May},
pages = {226 - 231},
keywords = {learning skills, robot learning, automatic weight learning},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.