Reinforcement Learning with Probabilistically Safe Control Barrier Functions for Ramp Merging

Soumith Udatha, Yiwei Lyu, and John M. Dolan

Conference Paper, Proceedings of (ICRA) International Conference on Robotics and Automation, pp. 5625-5630, May, 2023

View Publication

Abstract

Prior work has looked at applying reinforcement learning (RL) approaches to autonomous driving scenarios, but the safety of the algorithm is often compromised due to instability or the presence of ill-defined reward functions. With the use of control barrier functions embedded into the RL policy, we arrive at safe policies to optimize the performance of the autonomous driving vehicle through the advantage of a safety layer over the RL methods to ease the design of reward functions. However, control barrier functions need a good approximation of the model of the system. We use probabilistic control barrier functions [4] to account for model uncertainty. Our SafetyAssured Policy Optimization - Ramp Merging (SAPO-RM) algorithm is implemented online in the CARLA [1] Simulator and offline on the US I-80 dataset extracted from the NGSIM Database provided by NHTSA [2]. We further test the algorithm and perform ablation studies of it on the US-101 and exi-D
datasets to compare the approaches. The proposed algorithm can also be applied to other driving scenarios by changing the reward and safety constraints.

BibTeX

@conference{Udatha-2023-139372,
author = {Soumith Udatha and Yiwei Lyu and John M. Dolan},
title = {Reinforcement Learning with Probabilistically Safe Control Barrier Functions for Ramp Merging},
booktitle = {Proceedings of (ICRA) International Conference on Robotics and Automation},
year = {2023},
month = {May},
pages = {5625-5630},
keywords = {autonomous driving, reinforcement learning, Control Barrier Functions},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.