Application of a model of instrumental conditioning to mobile robot control - Robotics Institute Carnegie Mellon University

Application of a model of instrumental conditioning to mobile robot control

Conference Paper, Proceedings of SPIE Sensor Fusion and Decentralized Control in Autonomous Robotic Systems, Vol. 3209, pp. 55 - 66, October, 1997

Abstract

Instrumental conditioning is a psychological process whereby an animal learns to associate its actions with their consequences. This type of learning is exploited in animal training techniques such as 'shaping by successive approximations,' which enables trainers to gradually adjust the animal's behavior by giving strategically timed reinforcements. While this is similar in principle to reinforcement learning, the real phenomenon includes many subtle effects not considered in the machine learning literature. In addition, a good deal of domain information is utilized by an animal learning a new task; it does not start from scratch every time it learns a new behavior. For these reasons, it is not surprising that mobile robot learning algorithms have yet to approach the sophistication and robustness of animal learning. A serious attempt to model instrumental learning could prove fruitful for improving machine learning techniques. In the present paper, we develop a computational theory of shaping at a level appropriate for controlling mobile robots. The theory is based on a series of mechanisms for 'behavior editing,' in which pre-existing behaviors, either innate or previously learned, can be dramatically changed in magnitude, shifted in direction, or otherwise manipulated so as to produce new behavioral routines. We have implemented our theory on Amelia, an RWI B21 mobile robot equipped with a gripper and color video camera. We provide results from training Amelia on several tasks, all of which were constructed as variations of one innate behavior, object-pursuit.

BibTeX

@conference{Saksida-1997-16384,
author = {Lisa Saksida and David S. Touretzky},
title = {Application of a model of instrumental conditioning to mobile robot control},
booktitle = {Proceedings of SPIE Sensor Fusion and Decentralized Control in Autonomous Robotic Systems},
year = {1997},
month = {October},
editor = {Paul S. Schenker and Gerard T. McKee},
volume = {3209},
pages = {55 - 66},
}