Traffic sign detection based on multi-scale feature extraction and cascade feature fusion

Zhang, Yongliang; Lu, Yang; Zhu, Wuqiang; Wei, Xing; Wei, Zhen

doi:10.1007/s11227-022-04670-6

Traffic sign detection based on multi-scale feature extraction and cascade feature fusion

Published: 06 August 2022

Volume 79, pages 2137–2152, (2023)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Yongliang Zhang¹,
Yang Lu^1,2,3,
Wuqiang Zhu¹,
Xing Wei^1,3,4 &
…
Zhen Wei ORCID: orcid.org/0000-0002-3818-4680^1,2,3

1119 Accesses
6 Citations
Explore all metrics

Abstract

Existing algorithms have difficulty in solving the two tasks of localization and classification simultaneously when performing traffic sign detection on realistic images of complex traffic scenes. In order to solve the above problems, a new road traffic sign dataset is created, and based on the YOLOv4 algorithm, for the complexity of realistic traffic scene images and the large variation in the size of traffic signs in the images, the multi-scale feature extraction module, cascade feature fusion module and attention mechanism module are designed to improve the algorithm’s ability to locate and classify traffic signs simultaneously. Experimental results on the newly created dataset show that the improved algorithm achieves a mean average precision of 84.44%, which is higher than several major CNN-based object detection algorithms for the same type of task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MTSDet: multi-scale traffic sign detection with attention and path aggregation

Article 14 April 2022

Traffic Sign Detection Based on Camera Imaging Apriority

YOLOF-F: you only look one-level feature fusion for traffic sign detection

Article 17 March 2023

Data Availability

All data generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Saponara S (2013) Real-time color/shape-based traffic signs acquisition and recognition system. In: Real-time image and video processing 2013, Burlingame, CA, USA, Feb 6-7, vol 8656, p 86560
Khan JF, Bhuiyan SMA, Adhami RR (2011) Image segmentation and shape analysis for road-sign detection. IEEE Trans Intell Transp Syst 12(1):83–96
Article Google Scholar
Liang M, Yuan M, Hu X, Li J, Liu H (2013) Traffic sign detection by ROI extraction and histogram features-based recognition. In: The 2013 International Joint Conference on Neural Networks, IJCNN, 2013 Dallas, TX, USA, Aug 4-9, pp 1–8
Fleyeh H, Biswas R, Davami E (2013) Traffic sign detection based on adaboost color segmentation and svm classification. In: Eurocon, pp 2005–2010
Maldonado-Bascon S, Lafuente-Arroyo S, Gil-Jimenez P, Gomez-Moreno H, Lopez-Ferreras F (2007) Road-sign detection and recognition based on support vector machines. IEEE Trans Intell Transp Syst 8(2):264–278
Article MATH Google Scholar
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp 580–587
Girshick RB (2015) Fast R-CNN. In: 2015 IEEE international conference on computer vision, ICCV 2015, Santiago, Chile, Dec 7-13, pp 1440–1448
Ren S, He K, Girshick R, Sun J (2017) Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Article Google Scholar
Liu W, Anguelov D, Erhan D, Szegedy C, Reed SE, Fu C, Berg AC (2016) SSD: single shot multibox detector. In: Computer Vision - ECCV 2016 - 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part I, vol 9905, pp 21–37
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 779–788
Qiao K, Gu H, Liu J, Liu P (2017) Optimization of traffic sign detection and classification based on faster r-cnn. In: 2017 International Conference on Computer Technology, Electronics and Communication (ICCTEC), pp 608–611
Li J, Wang Z (2019) Real-time traffic sign recognition based on efficient cnns in the wild. IEEE Trans Intell Transp Syst 20(3):975–984
Article Google Scholar
Rajendran SP, Shine L, Pradeep R, Vijayaraghavan S (2019) Real-time traffic sign recognition using yolov3 based detector. In: 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pp 1–7
Everingham M, Gool LV, Williams CKI, Winn JM, Zisserman A (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338
Article Google Scholar
Lin T, Maire M, Belongie SJ, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft COCO: common objects in context. In: Computer Vision - ECCV 2014 - 13th European Conference, Zurich, Switzerland, September 6-12, Proceedings, Part V, vol 8693, pp 740–755
Liu Z, Du J, Tian F, Wen J (2019) Mr-cnn: a multi-scale region-based convolutional neural network for small traffic sign recognition. IEEE Access 7:57120–57128
Article Google Scholar
Tabernik D, Skočaj D (2020) Deep learning for large-scale traffic-sign detection and recognition. IEEE Trans Intell Transp Syst 21(4):1427–1440
Article Google Scholar
Lee HS, Kim K (2018) Simultaneous traffic sign detection and boundary estimation using convolutional neural network. IEEE Trans Intell Transp Syst 19(5):1652–1663
Article Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Computer Science
Szegedy C, Liu W, Jia Y, Sermanet P, Reed SE, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7-12, pp 1–9
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, pp 770–778
Stallkamp J, Schlipsing M, Salmen J, Igel C (2011) The german traffic sign recognition benchmark: a multi-class classification competition. In: International Joint Conference on Neural Networks
Stallkamp J, Schlipsing M, Salmen J et al (2012) Man vs. computer: benchmarking machine learning algorithms for traffic sign recognition. Neural Netw 32:323–332
Article Google Scholar
Uijlings JRR, Sande KEAVD, Gevers T, Smeulders AWM (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171
Article Google Scholar
Redmon J, Farhadi A (2018) Yolov3: An incremental improvement. CoRR: https://arxiv.org/abs/1804.02767
Shelhamer E, Long J, Darrell T (2017) Fully convolutional networks for semantic segmentation. IEEE Trans Pattern Anal Mach Intell 39(4):640–651
Article Google Scholar
Lin T, Dollár P, Girshick RB, He K, Hariharan B, Belongie SJ (2017) Feature pyramid networks for object detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21-26, pp 936–944
Bochkovskiy A, Wang C, Liao HM (2020) Yolov4: optimal speed and accuracy of object detection. CoRR https://arxiv.org/abs/2004.10934
Liu S, Qi L, Qin H, Shi J, Jia J (2018) Path aggregation network for instance segmentation. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18-22, pp 8759–8768
Zhu Z, Liang D, Zhang S, Huang X, Li B, Hu S (2016) Traffic-sign detection and classification in the wild. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, pp 2110–2118
Zhang J, Huang M, Jin X, Li X (2017) A real-time chinese traffic sign detection algorithm based on modified yolov2. Algorithms 10(4):127
Article MathSciNet MATH Google Scholar
Szegedy C, Ioffe S, Vanhoucke V, Alemi AA (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, Feb 4-9, San Francisco, California, USA, pp 4278–4284
Hu J, Shen L, Albanie S, Sun G, Wu E (2020) Squeeze-and-excitation networks. IEEE Trans Pattern Anal Mach Intell 42(8):2011–2023
Article Google Scholar
Misra D (2019) Mish: a self regularized non-monotonic neural activation function. CoRR https://arxiv.org/abs/1908.08681
Liu F, Qian Y, Li H, Wang Y, Zhang H (2021) Caffnet: channel attention and feature fusion network for multi-target traffic sign detection. Int J Pattern Recognit Artif Intell 35(7):2152008–1215200820
Article Google Scholar
Ren K, Huang L, Fan C, Han H, Deng H (2021) Real-time traffic sign detection network using ds-detnet and lite fusion FPN. J Real Time Image Process 18(6):2181–2191
Article Google Scholar
Serna CG, Ruichek Y (2020) Traffic signs detection and classification for european urban environments. IEEE Trans Intell Transp Syst 21(10):4388–4399
Article Google Scholar

Download references

Acknowledgements

This work is supported in part by the Anhui Provincial Key R &D Program of China under Grant 202004a05020040, in part by the Intelligent Network and New Energy Vehicle Special Project of Intelligent Manufacturing Institute of HFUT under Grant IMIWL2019003, in part by the National Key Research and Development Program of China under Grant 2018YFC0604404, and in part by the Fundamental Research Funds for the Central Universities under Grant PA2021GDGP0061.

Author information

Authors and Affiliations

School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, 230009, China
Yongliang Zhang, Yang Lu, Wuqiang Zhu, Xing Wei & Zhen Wei
Anhui Mine IOT and Security Monitoring Technology Key Laboratory, Hefei, 230088, China
Yang Lu & Zhen Wei
Engineering Research Center of Safety Critical Industrial Measurement and Control Technology, Ministry of Education, Hefei University of Technology, Hefei, 230009, China
Yang Lu, Xing Wei & Zhen Wei
Intelligent Manufacturing Institute of HeFei University of Technology, Hefei, 230009, China
Xing Wei

Authors

Yongliang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Lu
View author publications
You can also search for this author in PubMed Google Scholar
Wuqiang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Xing Wei
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Wei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhen Wei.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, Y., Lu, Y., Zhu, W. et al. Traffic sign detection based on multi-scale feature extraction and cascade feature fusion. J Supercomput 79, 2137–2152 (2023). https://doi.org/10.1007/s11227-022-04670-6

Download citation

Accepted: 13 June 2022
Published: 06 August 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s11227-022-04670-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Traffic sign detection based on multi-scale feature extraction and cascade feature fusion

Abstract

Access this article

Similar content being viewed by others

MTSDet: multi-scale traffic sign detection with attention and path aggregation

Traffic Sign Detection Based on Camera Imaging Apriority

YOLOF-F: you only look one-level feature fusion for traffic sign detection

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Traffic sign detection based on multi-scale feature extraction and cascade feature fusion

Abstract

Access this article

Similar content being viewed by others

MTSDet: multi-scale traffic sign detection with attention and path aggregation

Traffic Sign Detection Based on Camera Imaging Apriority

YOLOF-F: you only look one-level feature fusion for traffic sign detection

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation