Stochastic pedestrian avoidance for autonomous vehicles using hybrid reinforcement learning

Li, Huiqian; Huang, Jin; Cao, Zhong; Yang, Diange; Zhong, Zhihua

doi:10.1631/FITEE.2200128

Stochastic pedestrian avoidance for autonomous vehicles using hybrid reinforcement learning

基于混合强化学习的自动驾驶汽车行人避撞方法

Published: 23 January 2023

Volume 24, pages 131–140, (2023)
Cite this article

Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Huiqian Li (李惠乾) ORCID: orcid.org/0000-0003-4949-8834¹,
Jin Huang (黄晋) ORCID: orcid.org/0000-0001-8774-2936¹,
Zhong Cao (曹重)¹,
Diange Yang (杨殿阁)¹ &
…
Zhihua Zhong (钟志华)²

308 Accesses
4 Citations
Explore all metrics

Abstract

Ensuring the safety of pedestrians is essential and challenging when autonomous vehicles are involved. Classical pedestrian avoidance strategies cannot handle uncertainty, and learning-based methods lack performance guarantees. In this paper we propose a hybrid reinforcement learning (HRL) approach for autonomous vehicles to safely interact with pedestrians behaving uncertainly. The method integrates the rule-based strategy and reinforcement learning strategy. The confidence of both strategies is evaluated using the data recorded in the training process. Then we design an activation function to select the final policy with higher confidence. In this way, we can guarantee that the final policy performance is not worse than that of the rule-based policy. To demonstrate the effectiveness of the proposed method, we validate it in simulation using an accelerated testing technique to generate stochastic pedestrians. The results indicate that it increases the success rate for pedestrian avoidance to 98.8%, compared with 94.4% of the baseline method.

摘要

确保行人的安全对自动驾驶汽车而言至关重要，同时也具有一定挑战。经典的行人避撞策略无法应对不确定性，而基于学习的方法缺乏明确的性能保障。本文提出一种基于混合强化学习的行人避撞方法，以使自动驾驶车辆能够与具有行为不确定性的行人安全交互。该方法集成了规则策略和强化学习策略，并设计了一个激活函数选择具有更高置信度的作为最终策略，通过这种方式保证最终策略的表现不亚于规则策略。为说明所提方法的有效性，本文使用一种加速测试方法生成了行为随机的行人进行仿真验证。结果表明，该方法在测试场景中的成功率，相比基准方法的94.4%，提升至98.8%。

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Autonomous highway driving using reinforcement learning with safety check system based on time-to-collision

Article 26 December 2022

Xiaotong Nie, Yupeng Liang & Kazuhiro Ohkura

Optimizing pedestrian simulation based on expert trajectory guidance and deep reinforcement learning

Article 16 January 2023

Senlin Mu, Xiao Huang, … Xiang Li

Safe and Goal-Based Highway Maneuver Planning with Reinforcement Learning

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Bai HY, Cai SJ, Ye N, et al., 2015. Intention-aware online POMDP planning for autonomous driving in a crowd. IEEE Int Conf on Robotics and Automation, p.454–460. https://doi.org/10.1109/ICRA.2015.7139219
Batkovic I, Zanon M, Ali M, et al., 2019. Real-time constrained trajectory planning and vehicle control for proactive autonomous driving with road users. 18^th European Control Conf, p.256–262. https://doi.org/10.23919/ECC.2019.8796099
Bhattacharyya A, Reino DO, Fritz M, et al., 2021. Euro-PVI: pedestrian vehicle interactions in dense urban centers. Proc IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.6404–6413. https://doi.org/10.1109/CVPR46437.2021.00634
Bouton M, Nakhaei A, Fujimura K, et al., 2018. Scalable decision making with sensor occlusions for autonomous driving. IEEE Int Conf on Robotics and Automation, p.2076–2081. https://doi.org/10.1109/ICRA.2018.8460914
Cao Z, Yang DG, Xu SB, et al., 2021. Highway exiting planner for automated vehicles using reinforcement learning. IEEE Trans Intell Transp Syst, 22(2):990–1000. https://doi.org/10.1109/tits.2019.2961739
Article Google Scholar
Cao Z, Xu SB, Peng HE, et al., 2022. Confidence-aware reinforcement learning for self-driving cars. IEEE Trans Intell Transp Syst, 23(7):7419–7430. https://doi.org/10.1109/TITS.2021.3069497
Article Google Scholar
Everett M, Chen YF, How JP, 2021. Collision avoidance in pedestrian-rich environments with deep reinforcement learning. IEEE Access, 9:10357–10377. https://doi.org/10.1109/ACCESS.2021.3050338
Article Google Scholar
Feng S, Yan XT, Sun HW, et al., 2021. Intelligent driving intelligence test for autonomous vehicles with naturalistic and adversarial environment. Nat Commun, 12(1):748. https://doi.org/10.1038/s41467-021-21007-8
Article Google Scholar
García J, Fernández F, 2015. A comprehensive survey on safe reinforcement learning. J Mach Learn Res, 16(1):1437–1480.
MATH Google Scholar
Jayaraman SK, Tilbury DM, Yang XJ, et al., 2020a. Analysis and prediction of pedestrian crosswalk behavior during automated vehicle interactions. IEEE Int Conf on Robotics and Automation, p.6426–6432. https://doi.org/10.1109/icra40945.2020.9197347
Jayaraman SK, Robert LP, Yang XJ, et al., 2020b. Efficient behavior-aware control of automated vehicles at crosswalks using minimal information pedestrian prediction model. American Control Conf, p.4362–4368. https://doi.org/10.23919/ACC45564.2020.9147248
Kapania NR, Govindarajan V, Borrelli F, et al., 2019. A hybrid control design for autonomous vehicles at uncontrolled crosswalks. IEEE Intelligent Vehicles Symp, p.1604–1611. https://doi.org/10.1109/IVS.2019.8814116
Koç M, Yurtsever E, Redmill K, et al., 2021. Pedestrian emergence estimation and occlusion-aware risk assessment for urban autonomous driving. IEEE Conf on Intelligent Transportation Systems, p.292–297. https://doi.org/10.1109/ITSC48978.2021.9565071
Li ZR, Gong JW, Lu C, et al., 2020. Importance weighted Gaussian process regression for transferable driver behaviour learning in the lane change scenario. IEEE Trans Veh Technol, 69(11):12497–12509. https://doi.org/10.1109/TVT.2020.3021752
Article Google Scholar
Li ZR, Gong JW, Lu C, et al., 2021. Interactive behavior prediction for heterogeneous traffic participants in the urban road: a graph-neural-network-based multitask learning framework. IEEE/ASME Trans Mechatron, 26(3):1339–1349. https://doi.org/10.1109/TMECH.2021.3073736
Article Google Scholar
Li ZR, Lu C, Yi YT, et al., 2022a. A hierarchical framework for interactive behaviour prediction of heterogeneous traffic participants based on graph neural network. IEEE Trans Intell Transp Syst, 23(7):9102–9114. https://doi.org/10.1109/TITS.2021.3090851
Article Google Scholar
Li ZR, Gong J, Lu C, et al., 2022b. Personalized driver braking behavior modeling in the car-following scenario: an importance-weight-based transfer learning approach. IEEE Trans Ind Electron, 69(10):10704–10714. https://doi.org/10.1109/TIE.2022.3146549
Article Google Scholar
Liu Q, Li XY, Yuan SH, et al., 2021. Decision-making technology for autonomous vehicles: learning-based methods, applications and future outlook. IEEE Conf on Intelligent Transportation Systems, p.30–37. https://doi.org/10.1109/ITSC48978.2021.9564580
Mnih V, Kavukcuoglu K, Silver D, et al., 2015. Human-level control through deep reinforcement learning. Nature, 518(7540):529–533. https://doi.org/10.1038/nature14236
Article Google Scholar
National Highway Traffic Safety Administration, 2019. 2018 Fatal Motor Vehicle Crashes: Overview. Traffic Safety Facts Research Note, U.S. Department of Transportation, p.1–9.
Pusse F, Klusch M, 2019. Hybrid online POMDP planning and deep reinforcement learning for safer self-driving cars. IEEE Intelligent Vehicles Symp, p.1013–1020. https://doi.org/10.1109/IVS.2019.8814125
Rasouli A, Tsotsos JK, 2020. Autonomous vehicles that interact with pedestrians: a survey of theory and practice. IEEE Trans Intell Transp Syst, 21(3):900–918. https://doi.org/10.1109/TITS.2019.2901817
Article Google Scholar
Schratter M, Bouton M, Kochenderfer MJ, et al., 2019. Pedestrian collision avoidance system for scenarios with occlusions. IEEE Intelligent Vehicles Symp, p.1054–1060. https://doi.org/10.1109/IVS.2019.8814076
Wang XP, Peng HE, Zhao D, 2019. Combining reachability analysis and importance sampling for accelerated evaluation of highly automated vehicles at pedestrian crossing. Proc ASME Dynamic Systems and Control Conf, Article V003T18A011. https://doi.org/10.1115/DSCC2019-9179
Yang DF, Redmill K, Özgüner Ü, 2020. A multi-state social force based framework for vehicle-pedestrian interaction in uncontrolled pedestrian crossing scenarios. IEEE Intelligent Vehicles Symp, p.1807–1812. https://doi.org/10.1109/IV47402.2020.9304561
Yurtsever E, Capito L, Redmill K, et al., 2020. Integrating deep reinforcement learning with model-based path planners for automated driving. IEEE Intelligent Vehicles Symp, p.1311–1316. https://doi.org/10.1109/IV47402.2020.9304735
Zhong YX, Cao Z, Zhu MH, et al., 2020. CLAP: cloud-and-learning-compatible autonomous driving platform. IEEE Intelligent Vehicles Symp, p.1450–1456. https://doi.org/10.1109/IV47402.2020.9304828
Zhou WT, Jiang K, Cao Z, et al., 2020. Integrating deep reinforcement learning with optimal trajectory planner for automated driving. IEEE 23^rd Int Conf on Intelligent Transportation Systems, p.1–8. https://doi.org/10.1109/ITSC45102.2020.9294275

Download references

Author information

Authors and Affiliations

School of Vehicle and Mobility, Tsinghua University, Beijing, 100084, China
Huiqian Li (李惠乾), Jin Huang (黄晋), Zhong Cao (曹重) & Diange Yang (杨殿阁)
Chinese Academy of Engineering, Beijing, 100088, China
Zhihua Zhong (钟志华)

Authors

Huiqian Li (李惠乾)
View author publications
You can also search for this author in PubMed Google Scholar
Jin Huang (黄晋)
View author publications
You can also search for this author in PubMed Google Scholar
Zhong Cao (曹重)
View author publications
You can also search for this author in PubMed Google Scholar
Diange Yang (杨殿阁)
View author publications
You can also search for this author in PubMed Google Scholar
Zhihua Zhong (钟志华)
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Jin HUANG and Zhong CAO designed the research. Huiqian LI processed the data and drafted the paper. Diange YANG helped organize the paper. Huiqian LI and Zhihua ZHONG revised and finalized the paper.

Corresponding author

Correspondence to Jin Huang (黄晋).

Additional information

Compliance with ethics guidelines

Huiqian LI, Jin HUANG, Zhong CAO, Diange YANG, and Zhihua ZHONG declare that they have no conflict of interest.

Project supported by the National Natural Science Foundation of China (Nos. 61872217, U20A20285, 52122217, and U1801263) and the Key R&D Projects of the Ministry of Science and Technology of China (No. 2020YFB1710901)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, H., Huang, J., Cao, Z. et al. Stochastic pedestrian avoidance for autonomous vehicles using hybrid reinforcement learning. Front Inform Technol Electron Eng 24, 131–140 (2023). https://doi.org/10.1631/FITEE.2200128

Download citation

Received: 03 April 2022
Accepted: 10 August 2022
Published: 23 January 2023
Issue Date: January 2023
DOI: https://doi.org/10.1631/FITEE.2200128

Key words

关键词

CLC number

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Stochastic pedestrian avoidance for autonomous vehicles using hybrid reinforcement learning

Abstract

摘要

Access this article

Similar content being viewed by others

Autonomous highway driving using reinforcement learning with safety check system based on time-to-collision

Optimizing pedestrian simulation based on expert trajectory guidance and deep reinforcement learning

Safe and Goal-Based Highway Maneuver Planning with Reinforcement Learning

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Compliance with ethics guidelines

Rights and permissions

About this article

Cite this article

Key words

关键词

CLC number

Navigation

Abstract

摘要

Access this article

Similar content being viewed by others

Autonomous highway driving using reinforcement learning with safety check system based on time-to-collision

Optimizing pedestrian simulation based on expert trajectory guidance and deep reinforcement learning

Safe and Goal-Based Highway Maneuver Planning with Reinforcement Learning

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Compliance with ethics guidelines

Rights and permissions

About this article

Cite this article

Share this article

Key words

关键词

CLC number

Search

Navigation