An object detection algorithm based on the feature pyramid network and single shot multibox detector

Wang, Yanni; Liu, Xiang; Guo, Rongchun

doi:10.1007/s10586-022-03560-z

An object detection algorithm based on the feature pyramid network and single shot multibox detector

Published: 01 March 2022

Volume 25, pages 3313–3324, (2022)
Cite this article

Cluster Computing Aims and scope Submit manuscript

375 Accesses
2 Citations
Explore all metrics

Abstract

In order to solve the problem of weak detection of small targets in traditional methods, an improved object detection algorithm is proposed. First, the six multi-scale feature maps extracted from the original SSD algorithm are fused in turn to form a new feature map with detailed information and semantic information based on the feature pyramid network and the idea of single shot multibox detector algorithm. Then, the attention model is added to the fused feature map, and the feature information of small targets can be extracted effectively. With PASCAL VOC2007 and VOC2012 as the training set, the mean average precision tested in the VOC2007 test set reached 78.3%, which is 1.1% higher than the original algorithms. In different environments, the algorithm has accurate detection effect on densely distributed small objects, and the missed detection and robustness are better than other algorithms. At the same time, the detection speed can still meet the real-time requirements.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 3

Multi-scale Object Detection Algorithm Based on Adaptive Feature Fusion

An enhanced SSD with feature fusion and visual reasoning for object detection

Article 19 April 2018

Cross-scale information enhancement for object detection

Article 04 March 2024

Data availability

The data that supports the findings of this study are available within the article and the code that support the findings of this study are available from the corresponding author upon reasonable request.

References

Pan, P., Schonfeld, D.: Video tracking based on sequential particle filtering on graphs. IEEE Trans. Image Process. 20(6), 1641–1651 (2011)
Article MathSciNet Google Scholar
Huang, K.Q., Chen, X.T., Kang, Y.F., et al.: Intelligent visual surveillance: a review. Chin. J. Comput. 20(3), 1093–1118 (2015)
MathSciNet Google Scholar
Qiaorong, Z., Xinyang, F.: Object tracking based on visual saliency and particle filter. J. Image Graph. 18(5), 515–522 (2013)
Google Scholar
Yang, J., Chen, L.N., Chen, Y.S., et al.: Target detection and recognition based on depth learning. Inf. Technol. 42(10), 89–95 (2018)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant key points. Int. J. Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar
Viola, P., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vision 57(2), 137–154 (2004)
Article Google Scholar
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, OH, USA: IEEE. pp. 580–587 (2014)
Girshick, R.: Fast R-CNN. In: Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE. pp. 1440–1448 (2015)
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2015)
Article Google Scholar
Lin, C.F., Wang, S.D.: Fuzzy support vector machines. IEEE Trans. Neural Netw. 13(2), 464–471 (2002)
Article Google Scholar
Redmon, J., Divvala, S., Girshick, R., et al.: You only look once: unified, real-time object detection. In: Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las-Vegas, NV, USA: IEEE, 2016:779–788.
Liu, W., Anguelov, D., Erhan, D., et al.: SSD: single shot multibox detector. In: Proceedings of the 14th European Conference on Computer Vision, pp. 21–37. Springer, Amsterdam (2016)
Google Scholar
Lin, T. Y., Dollar, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Honolulu. pp. 936–944 (2017)
Huang, C.B., Liu, Q., Yu, S.S.: Region of interest extraction from color image based on visual saliency. J. Supercomput. 58(1), 20–33 (2011)
Article Google Scholar
Jun-hong, X.U., Chun-feng, D.I.N.G., Hai-bin, S.U., et al.: Moving object extraction algorithm based on improved GVF. Comput. Eng. 38(9), 199–201 (2012)
Google Scholar
Lee, J.H., Jang, T.J., Lee, I., et al.: Optimization and estimation of parameters for a compton camera consisting of the DSSD scatterer and the GAGG absorber with the monte carlo simulation. J. Korean Phys. Soc. 77(12), 1113–1117 (2020)
Article Google Scholar

Download references

Acknowledgements

We thank the anonymous reviewers for their constructive comments. This work is supported by the National Natural Science Foundation of China (No. 61803294) and the Natural Science Foundation of Shaanxi Province, China (No. 2020JM-499, No. 2020JQ-684).

Funding

Funding was provided by the National Natural Science Foundation, China (Grant No. 61803294) and the Natural Science Foundation of Shaanxi Province, China (Grant No. 2020JM-499, No. 2020JQ-684).

Author information

Authors and Affiliations

School of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an, 710055, Shaanxi, China
Yanni Wang & Xiang Liu
School of Digital Arts, Xi’an University of Posts and Telecommunications, Xi’an, 710121, Shaanxi, China
Rongchun Guo

Authors

Yanni Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Rongchun Guo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

YW proposed the idea of this algorithm and revised the manuscript. XL drafted the first edition of this paper by analyzing new feature map with detailed information and semantic information based on the feature pyramid network and single shot multibox detector algorithm. RG conducted parts of the simulation of experiments.

Corresponding author

Correspondence to Yanni Wang.

Ethics declarations

Conflict of interest

There are no potential competing interests in our paper. All authors have seen the manuscript and approved its submission to your journal. We confirm that the contents of the manuscript have not been published or submitted for publication elsewhere.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Y., Liu, X. & Guo, R. An object detection algorithm based on the feature pyramid network and single shot multibox detector. Cluster Comput 25, 3313–3324 (2022). https://doi.org/10.1007/s10586-022-03560-z

Download citation

Received: 13 September 2021
Revised: 26 November 2021
Accepted: 02 February 2022
Published: 01 March 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s10586-022-03560-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An object detection algorithm based on the feature pyramid network and single shot multibox detector

Abstract

Access this article

Similar content being viewed by others

Multi-scale Object Detection Algorithm Based on Adaptive Feature Fusion

An enhanced SSD with feature fusion and visual reasoning for object detection

Cross-scale information enhancement for object detection

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An object detection algorithm based on the feature pyramid network and single shot multibox detector

Abstract

Access this article

Similar content being viewed by others

Multi-scale Object Detection Algorithm Based on Adaptive Feature Fusion

An enhanced SSD with feature fusion and visual reasoning for object detection

Cross-scale information enhancement for object detection

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation