Learning by Demonstration with Critique from a Human Teacher

Brenna Argall, Brett Browning, and Manuela Veloso

Conference Paper, Proceedings of ACM/IEEE International Conference on Human-Robot Interaction (HRI '07), pp. 57 - 64, March, 2007

View Publication

Abstract

Learning by demonstration can be a powerful and natural tool for developing robot control policies. That is, instead of tedious hand-coding, a robot may learn a control policy by interacting with a teacher. In this work we present an algorithm for learning by demonstration in which the teacher operates in two phases. The teacher first demonstrates the task to the learner. The teacher next critiques learner performance of the task. This critique is used by the learner to update its control policy. In our implementation we utilize a 1-Nearest Neighbor technique which incorporates both training dataset and teacher critique. Since the teacher critiques performance only, they do not need to guess at an effective critique for the underlying algorithm. We argue that this method is particularly well-suited to human teachers, who are generally better at assigning credit to performances than to algorithms. We have applied this algorithm to the simulated task of a robot intercepting a ball. Our results demonstrate improved performance with teacher critiquing, where performance is measured by both execution success and efficiency.

BibTeX

@conference{Argall-2007-9680,
author = {Brenna Argall and Brett Browning and Manuela Veloso},
title = {Learning by Demonstration with Critique from a Human Teacher},
booktitle = {Proceedings of ACM/IEEE International Conference on Human-Robot Interaction (HRI '07)},
year = {2007},
month = {March},
pages = {57 - 64},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.