Abstract
Traffic sign detection is an essential part of traffic security and unmanned driving system. Due to the changes in the traffic environment is complex, how to intelligently and efficiently detect traffic signs in real scenes is of great significance. The traffic sign detection task is characterized by many small targets and complex environmental interference, and the detection scene also requires the detection model to be lightweight and efficient. This paper proposes a lightweight model Ghost-YOLO, and a lightweight module C3Ghost is designed to replace the feature extraction module in YOLOv5. C3Ghost modules extract features in a lightweight way, which effectively speeds up inference. At the same time, a new multi-scale feature extraction is designed to enhance the focus on small targets. Experimental results show that the mAP of the Ghost-YOLO is 92.71%, and the number of parameters and computations are respectively reduced to 91.4% and 50.29% of the original. Compared with multiple lightweight models, the speed and accuracy of this method are competitive.
Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author on reasonable request.
References
Arcos-García Á, Alvarez-Garcia JA, Soria-Morillo LM (2018) Evaluation of deep neural networks for traffic sign detection systems. Neurocomputing 316:332–344. https://doi.org/10.1016/j.neucom.2018.08.009
Belghaouti O, Handouzi W, Tabaa M (2020) Improved traffic sign recognition using deep ConvNet architecture. Procedia Comput Sci 177:468–473. https://doi.org/10.1016/j.procs.2020.10.064
Benallal M, Meunier J (2003) Real-time color segmentation of road signs. In CCECE 2003-Canadian conference on electrical and computer engineering. Toward a caring and humane technology (cat. No. 03CH37436) Vol. 3, pp 1823-1826. https://doi.org/10.1109/CCECE.2003.1226265
Bochkovskiy A, Wang CY, Liao HYM (2020) Yolov4: optimal speed and accuracy of object detection. https://doi.org/10.48550/arXiv.2004.10934
Dai J, Li Y, He K, Sun J (2016) R-fcn: Object detection via region-based fully convolutional networks. Advances in neural information processing systems, 29.
Ding Y, Ma Z, Wen S, Xie J et al (2021) AP-CNN: weakly supervised attention pyramid convolutional neural network for fine-grained visual classification. IEEE Trans Image Process 30:2826–2836. https://doi.org/10.1109/TIP.2021.3055617
Girshick R (2015) Fast RCNN, in 2015 IEEE International Conference on Computer Vision (ICCV) pp. 1440–1448
Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1580-1589
He K, Zhang X, Ren S, Sun J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37(9):1904–1916. https://doi.org/10.1109/TPAMI.2015.2389824
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition pp. 770-778
He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770-778
Howard A G, Zhu M, Chen B, Kalenichenko D, Wang W et al (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. https://doi.org/10.48550/arXiv.1704.04861
Howard AG, Zhu M, Chen B et al (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861
Huang G, Liu Z, Van Der Maaten L, et al (2017) Densely connected convolutional networks. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708.
Iandola F N, Han S, Moskewicz MW, Ashraf K, Dally WJ, Keutzer K (2016) SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size. https://doi.org/10.48550/arXiv.1602.07360
Jaderberg M, Vedaldi A, Zisserman A (2014) Speeding up convolutional neural networks with low rank expansions. arXiv preprint arXiv:1405.3866. https://doi.org/10.48550/arXiv.1405.3866
Kayhan N, Fekri-Ershad S (2021) Content based image retrieval based on weighted fusion of texture and color features derived from modified local binary patterns and local neighborhood difference patterns. Multimed Tools Appl 80(21):32763–32790
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Proces Syst 25:1097–1105
Li J, Liang X, Wei Y et al (2017) Perceptual generative adversarial networks for small object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1222-1230. https://doi.org/10.48550/arXiv.1706.05274
Lin T Y, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980-2988
Lin TY et al (2017) Feature pyramid networks for object detection, 2017 IEEE conference on computer vision and pattern recognition (CVPR) IEEE computer society
Liu XF, Xiong F (2020) A real-time traffic sign detection model based on improved YOLOv3. IOP Conf Ser: Mater Sci Eng 787:012034
Liu W, Anguelov D, Erhan D, Szegedy C et al (2016) SSD: single shot multibox detector. In European conference on computer vision. Springer, Cham. pp. 21-37
Liu S, Qi L, Qin H, Shi J, Jia J (2018) Path aggregation network for instance segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8759-8768. 10.48550/
Liu Z, Du J, Tian F, Wen J (2019) MR-CNN: a multi-scale region-based convolutional neural network for small traffic sign recognition. IEEE Access 7:57120–57128. https://doi.org/10.1109/ACCESS.2019.2913882
Liu Z, Shen C, Qi M, Fan X (2020) SADANet: integrating scale-aware and domain adaptive for traffic sign detection. IEEE Access 8:77920–77933. https://doi.org/10.1109/ACCESS.2020.2989758
Liu Y, Lu B, Peng J, Zhang Z (2020) Research on the use of YOLOv5 object detection algorithm in mask wearing recognition. World Sci Res J 6(11):276–284. https://doi.org/10.6911/WSRJ.202011_6(11).0038
Liu Y, Peng J, Xue J-H et al (2021) TSingNet: scale-aware and context-rich feature learning for trafc sign detection and recognition in the wild. Neurocomputing 447:10–22. https://doi.org/10.1016/j.neucom.2021.03.049
Mei Y, et al (2020) Pyramid attention networks for image restoration
Ou Z, Xia F, Xiong B, Shi S et al (2019) FAMN: feature aggregation multipath network for small traffic sign detection. IEEE Access 7:178798–178810. https://doi.org/10.1109/ACCESS.2019.2959015
Redmon J, Farhadi A (2018) Yolov3: An incremental improvement
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779-788
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inf Proces Syst 28
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28
Romero A, Ballas N, Kahou S E, Chassang A, Gatta C, Bengio Y (2014) Fitnets: hints for thin deep nets. https://doi.org/10.48550/arXiv.1412.6550
Sermanet P, LeCun, Y (2011) Traffic sign recognition with multi-scale convolutional networks. In: The 2011 International Joint Conference on Neural Networks pp 2809–2813. https://doi.org/10.1109/IJCNN.2011.6033589
Sharma S, Kumar K (2021) ASL-3DCNN: American sign language recognition technique using 3-D convolutional neural networks. Multimed Tools Appl 80(17):26319–26331
Shuang Z, Yi Z, Lu X (2006) Intelligent Approach for Triangle Traffic Sign Detection. J Image Graph 01(08):1127–1131
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Song S, Que Z, Hou J, Du S, Song Y (2019) An efficient convolutional neural network for small traffic sign detection. J Syst Archit 97:269–277. https://doi.org/10.1016/j.sysarc.2019.01.012
Tang Q, Cao G, Jo KH (2021) Integrated feature pyramid network with feature aggregation for traffc sign detection. IEEE Access 9:117784–117794. https://doi.org/10.1109/access.2021.3106350
ultralytics. (n.d.) YOLOv5. Available online: https://github.com/ultralytics/YOLOv5
Wan J, Ding W, Zhu H, Xia M, Huang Z, Tian L, Zhu Y, Wang H (2020) An efcient small trafc sign detection method based on YOLOv3. J Signal Process Syst Signal Image Video Technol 93:899–911. https://doi.org/10.1007/s11265-020-01614-2
Wang W et al (2019) Efficient and accurate arbitrary-shaped text detection with pixel aggregation network, 2019 IEEE/CVF international conference on computer vision (ICCV) IEEE
Wang CY, Liao HYM, Wu YH, Chen PY, Hsieh JW, Yeh IH (2020) CSPNet: A new backbone that can enhance learning capability of CNN. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 390–391
Wang C, Ning X, Sun L et al (2022) Learning discriminative features by covering local geometric space for point cloud analysis. IEEE Trans Geosci Remote Sens 60:1–15. https://doi.org/10.1109/TGRS.2022.3170493
Wu B, Iandola F, Jin P H, Keutzer K (2017) Squeezedet: unified, small, low power fully convolutional neural networks for real-time object detection for autonomous driving. In proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 129-137. https://doi.org/10.48550/arXiv.1612.01051
Xia Y, Xu W, Zhang L, Shi X, Mao K (2015) Integrating 3D structure into traffic scene understanding with RGB-D data. Neurocomputing 151:700–709. https://doi.org/10.1016/j.neucom.2014.05.091
Yang T, Long X, Sangaiah AK, Zheng Z, Tong C (2018) Deep detection network for real-life traffc sign in vehicular networks. Comput Netw 136:95–104. https://doi.org/10.1016/j.comnet.2018.02.026
Yuan X, Guo J, Hao X, Chen H (2015) Traffic sign detection via graph-based ranking and segmentation algorithms. IEEE Trans Syst Man Cybern Syst 45(12):1509–1521. https://doi.org/10.1109/TSMC.2015.2427771
Zhang Z, Huang K, Wang Y, Li M (2013) View independent object classification by exploring scene consistency information for traffic scene surveillance. Neurocomputing 99:250–260. https://doi.org/10.1016/j.neucom.2014.05.091
Zhang X, Zhou X, Lin M, Sun J (2018) Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 6848–6856. https://doi.org/10.48550/arXiv.1707.01083
Zhang J, Xie Z, Sun J, Zou X, Wang J (2020) A cascaded R-CNN with multiscale attention and imbalanced samples for traffic sign detection. IEEE Access 8:29742–29754. https://doi.org/10.1109/ACCESS.2020.2972338
Zhang H, Qin L, Li J, Guo Y, Xu Z (2020) Real-time detection method for small traffific signs based on Yolov3. IEEE Access 8:64145–64156
Zhou L, Deng, Z (2014) LIDAR and vision-based real-time traffic sign detection and recognition algorithm for intelligent vehicle. In: 17th international IEEE conference on intelligent transportation systems (ITSC), pp. 578–583. https://doi.org/10.1109/ITSC.2014.6957752
Zhou K, Zhan Y, Fu D (2021) Learning region-based attention network for traffic sign recognition. Sensors 21(3):686. https://doi.org/10.3390/s21030686
Zhu Y, Zhang C, Zhou D, Wang X, Bai X et al (2016) Traffic sign detection and recognition using fully convolutional network guided proposals. Neurocomputing 214:758–766. https://doi.org/10.1016/j.neucom.2016.07.009
Zhu Z, Liang D, Zhang S, Huang X, Li B, Hu S (2016) Traffic-sign detection and classification in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2110-2118
Acknowledgments
This work has been supported by the National Science Foundation of China, No.31870532.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, S., Che, S., Liu, Z. et al. A real-time and lightweight traffic sign detection method based on ghost-YOLO. Multimed Tools Appl 82, 26063–26087 (2023). https://doi.org/10.1007/s11042-023-14342-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14342-z