skip to main content
10.1145/3583780.3615454acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving

Published:21 October 2023Publication History

ABSTRACT

In this paper, we focus on the decision-making challenge in autonomous driving, a central and intricate problem influencing the safety and practicality of autonomous vehicles. We propose an innovative hierarchical imitation learning framework that effectively alleviates the complexity of learning in autonomous driving decision-making problems by decoupling decision-making tasks into sub-problems. Specifically, the decision-making process is divided into two levels of sub-problems: the upper level directs the vehicle's lane selection and qualitative speed management, while the lower level implements precise control of the driving speed and direction. We harness Transformer-based models for solving each sub-problem, enabling overall hierarchical framework to comprehend and navigate diverse and various road conditions, ultimately resulting in improved decision-making. Through an evaluation in several typical driving scenarios within the SMARTS autonomous driving simulation environment, our proposed hierarchical decision-making framework significantly outperforms end-to-end reinforcement learning algorithms and behavior cloning algorithm, achieving an average pass rate of over 90%. Our framework's effectiveness is substantiated by its commendable achievements at the NeurIPS 2022 Driving SMARTS competition, where it secures dual track championships.

References

  1. Mariusz Bojarski, Davide Del Testa, Daniel Dworakowski, Bernhard Firner, Beat Flepp, Prasoon Goyal, Lawrence D. Jackel, Mathew Monfort, Urs Muller, Jiakai Zhang, Xin Zhang, Jake Zhao, and Karol Zieba. 2016. End to End Learning for Self-Driving Cars. arxiv: 1604.07316 [cs.CV]Google ScholarGoogle Scholar
  2. Mariusz Bojarski, Philip Yeres, Anna Choromanska, Krzysztof Choromanski, Bernhard Firner, Lawrence Jackel, and Urs Muller. 2017. Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car. arxiv: 1704.07911 [cs.CV]Google ScholarGoogle Scholar
  3. Felipe Codevilla, Matthias Müller, Antonio López, Vladlen Koltun, and Alexey Dosovitskiy. 2018. End-to-end driving via conditional imitation learning. In 2018 IEEE international conference on robotics and automation (ICRA). IEEE, 4693--4700.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Serdar Coskun and Reza Langari. 2018. Predictive Fuzzy Markov Decision Strategy for Autonomous Driving in Highways. In 2018 IEEE Conference on Control Technology and Applications (CCTA). 1032--1039. https://doi.org/10.1109/CCTA.2018.8511369Google ScholarGoogle Scholar
  5. Christopher Diehl, Timo Sievernich, Martin Krüger, Frank Hoffmann, and Torsten Bertram. 2022. UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning. arxiv: 2111.11097 [cs.RO]Google ScholarGoogle Scholar
  6. Andreas Folkers, Matthias Rick, and Christof Büskens. 2019. Controlling an Autonomous Vehicle with Deep Reinforcement Learning. In 2019 IEEE Intelligent Vehicles Symposium (IV). 2025--2031. https://doi.org/10.1109/IVS.2019.8814124Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, and Sergey Levine. 2018. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International conference on machine learning. PMLR, 1861--1870.Google ScholarGoogle Scholar
  8. Anthony Hu, Gianluca Corrado, Nicolas Griffiths, Zachary Murez, Corina Gurau, Hudson Yeo, Alex Kendall, Roberto Cipolla, and Jamie Shotton. 2022. Model-Based Imitation Learning for Urban Driving. In Advances in Neural Information Processing Systems, S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh (Eds.), Vol. 35. Curran Associates, Inc., 20703--20716. https://proceedings.neurips.cc/paper_files/paper/2022/file/827cb489449ea216e4a257c47e407d18-Paper-Conference.pdfGoogle ScholarGoogle Scholar
  9. Zhiyu Huang, Haochen Liu, Jingda Wu, Wenhui Huang, and Chen Lv. 2023. Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving. arXiv preprint arXiv:2302.03939 (2023).Google ScholarGoogle Scholar
  10. Aviral Kumar, Aurick Zhou, George Tucker, and Sergey Levine. 2020. Conservative Q-Learning for Offline Reinforcement Learning. arxiv: 2006.04779 [cs.LG]Google ScholarGoogle Scholar
  11. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. nature, Vol. 518, 7540 (2015), 529--533.Google ScholarGoogle Scholar
  12. Tung Nguyen, Qinqing Zheng, and Aditya Grover. 2023. Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning. arxiv: 2210.05158 [cs.LG]Google ScholarGoogle Scholar
  13. Samyeul Noh. 2019. Decision-Making Framework for Autonomous Driving at Road Intersections: Safeguarding Against Collision, Overly Conservative Behavior, and Violation Vehicles. IEEE Transactions on Industrial Electronics, Vol. 66, 4 (2019), 3275--3286. https://doi.org/10.1109/TIE.2018.2840530Google ScholarGoogle ScholarCross RefCross Ref
  14. Alexandre Piche, Rafael Pardinas, David Vazquez, Igor Mordatch, and Chris Pal. 2022. Implicit Offline Reinforcement Learning via Supervised Learning. arXiv preprint arXiv:2210.12272 (2022).Google ScholarGoogle Scholar
  15. Zhiqian Qiao, Katharina Muelling, John Dolan, Praveen Palanisamy, and Priyantha Mudalige. 2018. POMDP and Hierarchical Options MDP with Continuous Actions for Autonomous Driving at Intersections. In 2018 21st International Conference on Intelligent Transportation Systems (ITSC). 2377--2382. https://doi.org/10.1109/ITSC.2018.8569400Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Amir Rasouli, Randy Goebel, Matthew E Taylor, Iuliia Kotseruba, Soheil Alizadeh, Tianpei Yang, Montgomery Alban, Florian Shkurti, Yuzheng Zhuang, Adam Scibior, et al. 2022. NeurIPS 2022 Competition: Driving SMARTS. arXiv preprint arXiv:2211.07545 (2022).Google ScholarGoogle Scholar
  17. Moveh Samuel, Mohamed Hussein, and Maziah Binti Mohamad. 2016. A review of some pure-pursuit based path tracking techniques for control of autonomous vehicle. International Journal of Computer Applications, Vol. 135, 1 (2016), 35--38.Google ScholarGoogle ScholarCross RefCross Ref
  18. Hao Shao, Letian Wang, Ruobing Chen, Hongsheng Li, and Yu Liu. 2023. Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer. In Proceedings of The 6th Conference on Robot Learning (Proceedings of Machine Learning Research, Vol. 205), Karen Liu, Dana Kulic, and Jeff Ichnowski (Eds.). PMLR, 726--737. https://proceedings.mlr.press/v205/shao23a.htmlGoogle ScholarGoogle Scholar
  19. Yuan Tian. 2022. SMARTS_VCR. https://github.com/yuant95/SMARTS_VCR.Google ScholarGoogle Scholar
  20. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention Is All You Need. arxiv: 1706.03762 [cs.CL]Google ScholarGoogle Scholar
  21. Pin Wang and Ching-Yao Chan. 2017. Formulation of deep reinforcement learning architecture toward autonomous driving for on-ramp merge. In 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC). 1--6. https://doi.org/10.1109/ITSC.2017.8317735Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Huazhe Xu, Yang Gao, Fisher Yu, and Trevor Darrell. 2017. End-To-End Learning of Driving Models From Large-Scale Video Datasets. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle ScholarCross RefCross Ref
  23. Pan Zhao, Jiajia Chen, Yan Song, Xiang Tao, Tiejuan Xu, and Tao Mei. 2012. Design of a control system for an autonomous vehicle based on adaptive-pid. International Journal of Advanced Robotic Systems, Vol. 9, 2 (2012), 44.Google ScholarGoogle ScholarCross RefCross Ref
  24. Ming Zhou, Jun Luo, Julian Villella, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, et al. 2020. Smarts: Scalable multi-agent reinforcement learning training school for autonomous driving. arXiv preprint arXiv:2010.09776 (2020).Google ScholarGoogle Scholar

Index Terms

  1. A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
        October 2023
        5508 pages
        ISBN:9798400701245
        DOI:10.1145/3583780

        Copyright © 2023 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 21 October 2023

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      • Article Metrics

        • Downloads (Last 12 months)189
        • Downloads (Last 6 weeks)25

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader