Conferences >2021 IEEE International Intel...

Adaptive Reward for CAV Action Planning using Monte Carlo Tree Search

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Cooperative action planning for Connected and Autonomous Vehicles (CAVs) in an emergency scenario is an important task in the autonomous driving domain. Reinforcement lea...Show More

Metadata

Abstract:

Cooperative action planning for Connected and Autonomous Vehicles (CAVs) in an emergency scenario is an important task in the autonomous driving domain. Reinforcement learning algorithms such as Monte Carlo Tree Search (MCTS) have popularly been used to solve this problem with some success. MCTS rely on performing many simulations of CAV actions to learn expected reward values for CAV actions. A refined reward function design is a necessary precondition for better success rates in MCTS. Traditionally, predefined reward functions with fixed reward parameters are used in all CAVs scenarios by most MCTS-based algorithms. This paper presents a novel Monte Carlo Tree Search (MCTS) based algorithm that dynamically modifies the reward function parameters to encourage or discourage particular CAV actions. Our proposed algorithm with a dynamic reward function significantly improves the reliability of MCTS having a fixed reward function. We evaluate the proposed algorithm in a large-scale multi-agent-based traffic simulation system. Experimental results show that our algorithm significantly improves upon current state-of-the-art centralized and decentralized algorithms.

Published in: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC)

Date of Conference: 19-22 September 2021

Date Added to IEEE Xplore: 25 October 2021

ISBN Information:

DOI: 10.1109/ITSC48978.2021.9564688

Conference Location: Indianapolis, IN, USA