Abstract
The multiscale feature fusion strategy has made substantial strides in object detection; however, it may result in feature loss or boundary distortion for larger objects when optimized for small object detection. In order to capture additional contextual information, this paper suggests a novel feature pyramid composite network structure that improves object feature extraction by incorporating residual feature extraction. In order to mitigate feature loss during the fusion process, a unified module is implemented to collect and combine global data. Furthermore, in order to optimize inter-layer information flow and mitigate interference from conflicting information, feature refinement is implemented in both channel and spatial dimensions. This method enhances the detection performance of small objects and maintains the critical features of larger objects. Experimental results indicate that this method surpasses other object detection models in terms of detection precision and achieves a 2.3% higher detection accuracy on the KITTI dataset than YOLOv9.










Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Data availability
No datasets were generated or analysed during the current study.
References
Zaidi SSA, Ansari MS, Aslam A, Kanwal N, Asghar M, Lee B (2022) A survey of modern deep learning based object detection models. Digit Signal Process 126:103514
Zou Z, Chen K, Shi Z, Guo Y, Ye J (2023) Object detection in 20 years: a survey. Proc IEEE 111(3):257–276
Jiao L, Wang M, Liu X, Li L, Liu F, Feng Z, Yang S, Hou B (2024) Multiscale deep learning for detection and recognition: a comprehensive survey. IEEE Trans Neural Netw Learn Syst
Xiao J, Guo H, Zhou J, Zhao T, Yu Q, Chen Y, Wang Z (2023) Tiny object detection with context enhancement and feature purification. Expert Syst Appl 211:118665
Chen S, Zhang Z, Zhang L, He R, Li Z, Xu M, Ma H (2024) A semi-supervised learning framework combining cnn and multi-scale transformer for traffic sign detection and recognition. IEEE Internet Things J
Gao X, Chen L, Wang K, Xiong X, Wang H, Li Y (2022) Improved traffic sign detection algorithm based on faster r-cnn. Appl Sci 12(18):8948
Liu Y, Sun P, Wergeles N, Shang Y (2021) A survey and performance evaluation of deep learning methods for small object detection. Expert Syst Appl 172:114602
Chen J, Wang Q, Peng W, Xu H, Li X, Xu W (2022) Disparity-based multiscale fusion network for transportation detection. IEEE Trans Intell Transp Syst 23(10):18855–18863
Lian J, Yin Y, Li L, Wang Z, Zhou Y (2021) Small object detection in traffic scenes based on attention feature fusion. Sensors 21(9):3031
Ge Z (2021) Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430
Terven J, Córdova-Esparza D-M, Romero-González J-A (2023) A comprehensive review of yolo architectures in computer vision: from yolov1 to yolov8 and yolo-nas. Mach Learn Knowl Extr 5(4):1680–1716
Luo Y, Zhang C, Lin W, Yang X, Sun J (2022) Sequential attention-based distinct part modeling for balanced pedestrian detection. IEEE Trans Intell Transp Syst 23(9):15644–15654
Zhou H, Yu G (2021) Research on pedestrian detection technology based on the svm classifier trained by hog and ltp features. Futur Gener Comput Syst 125:604–615
Yucong S, Shuqing G (2021) Traffic sign recognition based on hog feature extraction. J Meas Eng 9(3):142–155
Ren X, Wang R (2023) A target detection algorithm in traffic scenes based on deep reinforcement learning. arXiv preprint arXiv:2312.15606
Shen L, You L, Peng B, Zhang C (2021) Group multi-scale attention pyramid network for traffic sign detection. Neurocomputing 452:1–14
Guo Q, Zhang J, Zhu S, Zhong C, Zhang Y (2021) Deep multiscale siamese network with parallel convolutional structure and self-attention for change detection. IEEE Trans Geosci Remote Sens 60:1–12
Li S, Yan Q, Liu P (2020) An efficient fire detection method based on multiscale feature extraction, implicit deep supervision and channel attention mechanism. IEEE Trans Image Process 29:8467–8475
Mahaur B, Mishra K (2023) Small-object detection based on yolov5 in autonomous driving systems. Pattern Recognit Lett 168:115–122
Wei H, Zhang Q, Qian Y, Xu Z, Han J (2023) Mtsdet: multi-scale traffic sign detection with attention and path aggregation. Appl Intell 53(1):238–250
Xia J, Li M, Liu W, Chen X (2023) Dsra-detr: an improved detr for multiscale traffic sign detection. Sustainability 15(14):10862
Wang C, He W, Nie Y, Guo J, Liu C, Wang Y, Han K (2024) Gold-yolo: Efficient object detector via gather-and-distribute mechanism. Advances in Neural Information Processing Systems, vol 36
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 7132–7141
Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 3146–3154
Zhang J, Li X, Li J, Liu L, Xue Z, Zhang B, Jiang Z, Huang T, Wang Y, Wang C (2023) Rethinking mobile block for efficient attention-based models. In: 2023 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE Computer Society, pp 1389–1400
Geiger A, Lenz P, Urtasun R (2012) Are we ready for autonomous driving? The Kitti vision benchmark suite. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp 3354–3361
Zhang J, Zou X, Kuang L-D, Wang J, Sherratt RS, Yu X (2022) Cctsdb 2021: a more comprehensive traffic sign detection benchmark. Human-centric Computing and Information Sciences, vol 12
Author information
Authors and Affiliations
Contributions
Tao Liu: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Writing—original draft. Wendong Zhang: Resources, Supervision, Chenyoukang Lin:data curation Yunteng Hu: data curation Ruyi Cao: Formal analysis.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, T., Lin, C., Hu, Y. et al. Traffic target detection based on context enhancement and feature purification. J Supercomput 81, 413 (2025). https://doi.org/10.1007/s11227-025-06944-1
Accepted:
Published:
DOI: https://doi.org/10.1007/s11227-025-06944-1