B-FPN SSD: an SSD algorithm based on a bidirectional feature fusion pyramid

Liu, Qunpo; Bi, Junjia; Zhang, Jingwen; Bu, Xuhui; Hanajima, Naohiko

doi:10.1007/s00371-022-02727-4

B-FPN SSD: an SSD algorithm based on a bidirectional feature fusion pyramid

Original article
Published: 04 December 2022

Volume 39, pages 6265–6277, (2023)
Cite this article

The Visual Computer Aims and scope Submit manuscript

Qunpo Liu ORCID: orcid.org/0000-0002-2157-0278^1,2,
Junjia Bi¹,
Jingwen Zhang¹,
Xuhui Bu^1,2 &
…
Naohiko Hanajima^2,3

414 Accesses
4 Citations
1 Altmetric
Explore all metrics

Abstract

This paper proposes a bidirectional feature fusion pyramid (B-FPN) Single Shot Multiple Frame Detector (SSD) algorithm. First, a bidirectional feature pyramid (B-FPN) structure is constructed, which realizes the bidirectional fusion of the feature layers and improves the accuracy of detection. Second, we introduce coordinate attention (CA) to focus on the important channel features while preserving their location information, thereby increasing the focus on the important information. Finally, optimizing the loss function speeds up the convergence of the model and further improves the detection accuracy of the network. The experimental results show that on the VOC2007 dataset, the mAP of the algorithm in this paper is 76.48%, which is 3.52% higher than that of the SSD algorithm. On the COCO 2017 dataset, the mAP of the proposed algorithm is 3.85% higher than that of the SSD algorithm. Compared with other mainstream target detection algorithms, the algorithm in this paper has certain advantages in detection accuracy, and can also achieve satisfactory results in detection speed. Finally, the accuracy of foreign object recognition in the special environment of iron ore transportation is 98.26%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Local Enhancement and Bidirectional Feature Refinement Network for Single-Shot Detector

Article 15 February 2021

A recursive attention-enhanced bidirectional feature pyramid network for small object detection

Article 27 September 2022

Gated bidirectional feature pyramid network for accurate one-shot detection

Article Open access 13 March 2019

Data availability statement

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Zhu, M.R., Niu, H.X.: Railway foreign body intrusion recognition algorithm based on improved YOLOv3 model. J. Beijing Jiaotong Univ. 46(02), 37–45 (2022)
Google Scholar
Zhu, Y.C.: Research on Identification of Foreign Objects in Coal Belts in Coal Mines Based on Deep Learning. Liaoning Technical University, Liaoning (2021)
Google Scholar
Zhang, H.M.: Research on Foreign Object Identification Method on Belt Conveyor Based on Deep Learning. Anhui University of Technology, Anhui (2020)
Google Scholar
Lv, Z.Q.: Research on Image Recognition of Foreign Objects in Coal Mine Belt Transportation Under Complex Environment. China University of Mining and Technology, Beijing (2020)
Google Scholar
Wu S P, Ding E J, Y X. Identification method of foreign objects in conveyor belt based on improved FPN. Safety in Coal Mines, 2019,50(12):127–130.
Hu J H, Gao Y, Zhang H J, et al. Identification method of non-coal foreign matter in belt conveyor based on deep learning. Journal of Mine Automation, 2021, 47(06):57–62+90.
Zhang Y. Research on Traffic Target Detection Algorithm Based on YOLO-V3[D]. Anhui University of Science & Technology, 2021.
Yuan, Z.H., Sun, Q., Li, G.X., et al.: Automatic driving target detection based on Yolov3. J.Chongqing Univ. Technol. (Natural Sci.) 34(09), 56–61 (2020)
Google Scholar
Zhang, X.Y., Gao, H.B., Zhao, J.H.: Overview of deep learning intelligent driving methods. J Tsinghua Univ 58(4), 438–444 (2018)
Google Scholar
Wu H, C.Y.W.N.. Sequence Level Semantics Aggregation for Video Object Detection. IEEE, 2019.
Xiong, C.H., Lv, W.H., Wu, W.: Application and Development of Artificial Intelligence Technology for Intelligence Reconnaissance Field. Command Information System and Technology 10(5), 8–13 (2019)
Google Scholar
Li, H.H., Zhou, K.P., Han, T.C.: Ship object detection based on SSD improved with CReLU and FPN. Chinese Journal of Scientific Instrument 41(04), 183–190 (2020)
Google Scholar
Zhang S, Wen L, Bian X, et al. Occlusion-aware R-CNN: Detecting pedestrians in a crowd. In: Proceedings of the European Conference on Computer Vision (ECCV), 2018:637–653.
Yang, S.D., Chen, Z.H., Ma, X.M., et al.: Real-time high-precision pedestrian tracking: a detection–tracking–correction strategy based on improved SSD and Cascade R-CNN. J. Real-Time Image Proc. 19, 287–302 (2022)
Article Google Scholar
Li, J., Liang, X., Shen, S.M., et al.: Scale-aware fast R-CNN for pedestrian detection. IEEE Trans. Multimedia 20(4), 985–996 (2017)
Google Scholar
Ren, G.Q., Han, H.Y., Li, C.J., et al.: Foreign object detection in coal mine belt transportation based on Fast_YOLOv3 algorithm. Industry and Mine Automation 47(08), 77–83 (2021)
Google Scholar
Xie, F., Zhu, D.J.: Survey on Deep Learning Object Detection. Computer Systems & Applications 31(02), 1–12 (2022)
Google Scholar
Bochkovskiy A, Wang C Y, Liao H Y Mark. YOLOv4: Optimal speed and accuracy of object detection[EB/OL]. (2020–04–23) [2021–06–04]. https://arxiv.org/abs/2004.10934.
Huo, A.Q., Yang, Y.Y., Xie, G.K.: Vehicle target detection based on improved YOLOv3 algorithm. COMPUTER ENGINEERING AND DESIGN. 43(07), 1981–1989 (2022)
Google Scholar
Du, J.Y., Chen, R., Hao, L., et al.: Coal mine belt conveyor foreign object detection. Industry and Mine Automation 47(08), 77–83 (2021)
Google Scholar
Redmon J, Farhadi A. YOLOv3: An incremental improvement ∥ IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA: IEEE, 2017: 6517–6525.
Hao S, Zhang X, Ma X, et al. Foreign object detection in coal mine conveyor belt based on CBAM-YOLOv5. Journal of China Coal Society, 2021: 1–11.
Woo, S., Park, J., Lee, J.Y., et al.: CBAM: Convolutional Block Attention Module. European conference on computer vision 11211, 3–19 (2018)
Google Scholar
GLENN JOCHER, et al. YOLOv5[EB/OL]. https://github.com/ultralytics/yolov5, 2021. 9
Tang, W., Fazhi He, Yu., Liu, et al.: MA TR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer. IEEE Trans. Image Process. 31, 5134–5149 (2022)
Article Google Scholar
Behnood Rasti, Pedram Ghamisi. Remote sensing image classification using subspace sensor fusion. Information Fusion. 2020,121–130.
Wei Tang, Fazhi He, Yu Liu, et al. YDTR: Infrared and Visible Image Fusion via Y -shape Dynamic Transformer. IEEE Transactions on Multimedia. 2022.
Chenxing Xia, Yanguang Sun, Xiuju Gao, et al. DMINet: dense multi-scale inference network for salient object detection. The Visual Computer. 2022.
Pengfei Wang, Minglian Wang, Dongzhi He. Multi-scale feature pyramid and multi-branch neural network for person re-identification. The Visual Computer. 2022
Hou Q, Zhou D Q, Feng J S. Coordinate Attention for Efficient Mobile Network Design//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. NJ:IEEE, 2021:13713–13722.
Lin T Y, Dollar P, Girshick R, et al. Feature pyramid networks for object detection// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017:936–944.
Chen, X.C.: Improved bounding box regression loss function based on smoothL1. COLLEGE MATHEMATICS 37(05), 18–23 (2021)
Google Scholar
Liu W, Anguelov D, Erhan D, et al. SSD: single shot MultiBox detector. European Conference on Computer vision, 2016: 21 -37.
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 91–99 (2017)
Article Google Scholar
Wang, C., A. Bochkovskiy and H.M. Liao, Scaled-YOLOv4:Scaling Cross Stage Partial Network. IEEE Conference on Computer Vision and Pattern Recognition, 2020.
Ge, Z., et al.: YOLOX: Exceeding YOLO Series in 2021. IEEE Conference on Computer Vision and Pattern Recognition, 2021.

Download references

Acknowledgements

This work is partially supported by the National Natural Science Foundation of China (No. U1804147), Innovative Scientists and Technicians Team of Henan Provincial High Education (20IRTSTHN019), Science and Technology Project of Henan Province(No. 212102210508) .

Author information

Authors and Affiliations

School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo, China
Qunpo Liu, Junjia Bi, Jingwen Zhang & Xuhui Bu
Henan International Joint Laboratory of Direct Drive and Control of Intelligent Equipment, Jiaozuo, Henan, China
Qunpo Liu, Xuhui Bu & Naohiko Hanajima
College of Information and Systems, Muroran Institute of Technology, 27-1 Mizumoto-Cho, Muroran, Hokkaido, 050-8585, Japan
Naohiko Hanajima

Authors

Qunpo Liu
View author publications
You can also search for this author in PubMed Google Scholar
Junjia Bi
View author publications
You can also search for this author in PubMed Google Scholar
Jingwen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xuhui Bu
View author publications
You can also search for this author in PubMed Google Scholar
Naohiko Hanajima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Junjia Bi.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Liu, Q., Bi, J., Zhang, J. et al. B-FPN SSD: an SSD algorithm based on a bidirectional feature fusion pyramid. Vis Comput 39, 6265–6277 (2023). https://doi.org/10.1007/s00371-022-02727-4

Download citation

Accepted: 08 November 2022
Published: 04 December 2022
Issue Date: December 2023
DOI: https://doi.org/10.1007/s00371-022-02727-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

B-FPN SSD: an SSD algorithm based on a bidirectional feature fusion pyramid

Abstract

Access this article

Similar content being viewed by others

Local Enhancement and Bidirectional Feature Refinement Network for Single-Shot Detector

A recursive attention-enhanced bidirectional feature pyramid network for small object detection

Gated bidirectional feature pyramid network for accurate one-shot detection

Data availability statement

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

B-FPN SSD: an SSD algorithm based on a bidirectional feature fusion pyramid

Abstract

Access this article

Similar content being viewed by others

Local Enhancement and Bidirectional Feature Refinement Network for Single-Shot Detector

A recursive attention-enhanced bidirectional feature pyramid network for small object detection

Gated bidirectional feature pyramid network for accurate one-shot detection

Data availability statement

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation