/Enabling Learning From Large Datasets: Applying Active Learning to Mobile Robotics

Enabling Learning From Large Datasets: Applying Active Learning to Mobile Robotics

Cristian Dima, Martial Hebert and Anthony (Tony) Stentz
Conference Paper, International Conference on Robotics and Automation, Vol. 1, pp. 108 - 114, April, 2004

Download Publication (PDF)

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. These works may not be reposted without the explicit permission of the copyright holder.


Autonomous navigation in outdoor, off-road environments requires solving complex classification problems. Obstacle detection, road following and terrain classification are examples of tasks which have been successfully approached using supervised machine learning techniques for classification. Large amounts of training data are usually necessary in order to achieve satisfactory generalization. In such cases, manually labeling data becomes an expensive and tedious process. This paper describes a method for reducing the amount of data that needs to be presented to a human trainer. The algorithm relies on kernel density estimation in order to identify “interesting” scenes in a dataset. Our method does not require any interaction with a human expert for selecting the images, and only minimal amounts of tuning are necessary. We demonstrate its effectiveness in several experiments using data collected with two different vehicles. We first show that our method automatically selects those scenes from a large dataset that a person would consider “important” for classification tasks. Secondly, we show that by labeling only few of the images selected by our method, we obtain classification performance that is comparable to the one reached after labeling hundreds of images from the same dataset.

BibTeX Reference
author = {Cristian Dima and Martial Hebert and Anthony (Tony) Stentz},
title = {Enabling Learning From Large Datasets: Applying Active Learning to Mobile Robotics},
booktitle = {International Conference on Robotics and Automation},
year = {2004},
month = {April},
volume = {1},
pages = {108 - 114},
publisher = {IEEE},
keywords = {outdoor mobile robotics, outdoor obstacle detection, machine learning, active learning},