skip to main content
10.1145/3639631.3639639acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacaiConference Proceedingsconference-collections
research-article

Automatic driving multi task awareness network based on improved YOLOP

Published: 16 February 2024 Publication History

Abstract

In recent years, deep learning technology has developed rapidly and is widely used in the field of autonomous vehicle driving. Multi-task networks can simultaneously complete multiple tasks in autonomous driving environment perception, and can minimize hardware costs while being highly efficient, so they are very popular in low-cost autonomous driving systems. This paper improves the problems existing in the YOLOP network and proposes an efficient multi-task autonomous driving perception network, which uses a shared encoder and three independent decoding heads to simultaneously complete road traffic target detection, drivable area detection and lane detection. Three major perception tasks of line detection. We use the SPPF module in the backbone network instead of the SPP module in YOLOP, so that the network can obtain feature information of different scales while being lightweight; secondly, in the Neck layer, we abandoned the classic structure of FPN+PANet in YOLOP and proposed the HDM structure Rich receptive field information can be extracted, and multi-scale feature fusion tasks can be cleverly completed by cascading with shallow networks. Finally, in the experiment, we found that different tasks of the multi-task network have different requirements for feature information. Each task We cannot simply use the deepest features of Neck like YOLOP, so we adjusted the network structure and input the required feature information for different tasks. The final results show that the multi-task network we designed can achieve an astonishing 97 FPS on NVIDIA TESLA V100, which is far more than the 49 FPS of the YOLOP model under the same experimental settings, ensuring that autonomous vehicles can quickly and accurately perceive the surrounding environment., which is conducive to the safe and reliable driving of autonomous vehicles.

References

[1]
Qian Y, Dolan J M, Yang M. DLT-Net: Joint detection of drivable areas, lane lines, and traffic objects[J]. IEEE Transactions on Intelligent Transportation Systems, 2019, 21(11): 4670-4679.
[2]
Teichmann M, Weber M, Zoellner M, Multinet: Real-time joint semantic reasoning for autonomous driving[C]//2018 IEEE intelligent vehicles symposium (IV). IEEE, 2018: 1013-1020.
[3]
Xu G, Khan A S, Moshayedi A J, The Object Detection, Perspective and Obstacles In Robotic: A Review[J]. EAI Endorsed Transactions on AI and Robotics, 2022, 1(1).
[4]
He K, Gkioxari G, Dollár P, Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.
[5]
Girshick R. Fast r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.
[6]
Wu D, Liao M W, Zhang W T, Yolop: You only look once for panoptic driving perception[J]. Machine Intelligence Research, 2022, 19(6): 550-562.
[7]
Han C, Zhao Q, Zhang S, Yolopv2: Better, faster, stronger for panoptic driving perception[J]. ar**v preprint ar**v:2208.11434, 2022.
[8]
Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection[J]. ar**v preprint ar**v:2004.10934, 2020.
[9]
Yu F, Chen H, Wang X, Bdd100k: A diverse driving dataset for heterogeneous multitask learning[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 2636-2645.
[10]
Wang C Y, Bochkovskiy A, Liao H Y M. Scaled-yolov4: Scaling cross stage partial network[C]//Proceedings of the IEEE/cvf conference on computer vision and pattern recognition. 2021: 13029-13038.
[11]
Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection[J]. ar**v preprint ar**v:2004.10934, 2020.
[12]
Yu F, Koltun V. Multi-scale context aggregation by dilated convolutions[J]. ar**v preprint ar**v:1511.07122, 2015.
[13]
Lin T Y, Maire M, Belongie S, Microsoft coco: Common objects in context[C]//Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer International Publishing, 2014: 740-755.
[14]
Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. ar**v preprint ar**v:1804.02767, 2018.
[15]
Zheng Z, Wang P, Liu W, Distance-IoU loss: Faster and better learning for bounding box regression[C]//Proceedings of the AAAI conference on artificial intelligence. 2020, 34(07): 12993-13000.
[16]
Loshchilov I, Hutter F. Sgdr: Stochastic gradient descent with warm restarts[J]. ar**v preprint ar**v:1608.03983, 2016.
[17]
Yun S, Han D, Oh S J, Cutmix: Regularization strategy to train strong classifiers with localizable features[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2019: 6023-6032.
[18]
Dai X. HybridNet: A fast vehicle detection system for autonomous driving[J]. Signal Processing: Image Communication, 2019, 70: 79-88.
[19]
Tan M, Pang R, Le Q V. Efficientdet: Scalable and efficient object detection[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 10781-10790.

Index Terms

  1. Automatic driving multi task awareness network based on improved YOLOP
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        ACAI '23: Proceedings of the 2023 6th International Conference on Algorithms, Computing and Artificial Intelligence
        December 2023
        371 pages
        ISBN:9798400709203
        DOI:10.1145/3639631
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 16 February 2024

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Autonomous driving
        2. HDM
        3. Multi-task network

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        ACAI 2023

        Acceptance Rates

        Overall Acceptance Rate 173 of 395 submissions, 44%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 35
          Total Downloads
        • Downloads (Last 12 months)34
        • Downloads (Last 6 weeks)5
        Reflects downloads up to 08 Mar 2025

        Other Metrics

        Citations

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media