/Structured Apprenticeship Learning

Structured Apprenticeship Learning

Abdeslam Boularias, Oliver Kroemer and Jan Peters
Conference Paper, European Conference on Machine Learning (ECML), January, 2012

Download Publication (PDF)

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. These works may not be reposted without the explicit permission of the copyright holder.


We propose a graph-based algorithm for apprenticeship learning when the reward features are noisy. Previous apprenticeship learning techniques learn a reward function by using only local state features. This can be a limitation in practice, as often some features are misspecified or subject to measurement noise. Our graphical framework, inspired from the work on Markov Random Fields, allows to alleviate this problem by propagating information between states, and rewarding policies that choose similar actions in adjacent states. We demonstrate the advantage of the proposed approach on grid-world navigation problems, and on the problem of teaching a robot to grasp novel objects in simulation.

BibTeX Reference
author = {Abdeslam Boularias and Oliver Kroemer and Jan Peters},
title = {Structured Apprenticeship Learning},
booktitle = {European Conference on Machine Learning (ECML)},
year = {2012},
month = {January},