Concise feature pyramid region proposal network for multi-scale object detection

Fang, Baofu; Fang, Lu

doi:10.1007/s11227-018-2569-1

Concise feature pyramid region proposal network for multi-scale object detection

Published: 30 August 2018

Volume 76, pages 3327–3337, (2020)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Baofu Fang¹ &
Lu Fang¹

441 Accesses
9 Citations
Explore all metrics

Abstract

Object detection is a hot research issue in the field of computer vision. Many methods focus on detecting large objects. And features of small objects are easily weakened or even disappeared after multiple convolution layers. So the detection rate of multi-scale objects is unsatisfied. Aiming at this problem, a concise feature pyramid region proposal network (CFPRPN) is proposed to address the problem of small objects detection in this paper without missing the large objects. In the process of object detection, we propose a new method of adjustment for the object location. So the balanced detection of multi-scale objects is realized. CFPRPN combines image pyramids and feature pyramids. An image pyramid consists of scaled versions of an image and the feature pyramids produce multiple layers’ feature maps. They are both conducive to capturing the feature information of small objects in deep convolutional networks. At the same time, proposals of overlapping sizes from different layers are applied to improve the recall rate of multi-scale objects. These series operations are beneficial for CFPRPN to extract better proposals. We experimentally prove that after adding the fine-tuning location, the detection rate of multi-scale object is further improved. The inspiring thing is that refining location method is suitable for most algorithms of object detection.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning Discriminated Features Based on Feature Pyramid Networks and Attention for Multi-scale Object Detection

Article 26 August 2022

Consistent scale normalization for object perception

Article 04 January 2021

Multi-level feature fusion pyramid network for object detection

Article 04 July 2022

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

Zhou Y, Han J, Yuan X, Wei Z, Hong R (2017) Inverse sparse group lasso model for robust object tracking. IEEE Trans Multimed 19(8):1798–1810
Article Google Scholar
Wang H, Fan Y, Fang B (2018) Generalized linear discriminant analysis based on Euclidean norm for gait recognition. Int J Mach Learn Cybernet 9(4):569–576
Article Google Scholar
Ommer B, Malik J (2009) Multi-scale object detection by clustering lines. In: IEEE International Conference on Computer Vision, pp 484–491
Uijlings JR, Sande KE, Gevers T, Smeulders AW (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171
Article Google Scholar
Zitnick CL, Doll´ar P (2014) Edge boxes: Locating object proposals from edges. In European Conference on Computer Vision, pp 391–405
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp 580–587
Girshick R (2015) Fast R-CNN. In: IEEE International Conference on Computer Vision, pp 1440–1448
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In International Conference on Neural Information Processing Systems, pp 91–99
Shrivastava A, Gupta A, Girshick R (2016) Training region-based object detectors with online hard example mining. In IEEE International Conference on Computer Vision and Pattern Recognition, pp 761–769
Gkioxari G, Girshick R, Malik J (2015) Contextual action recognition with R-CNN. In: IEEE International Conference on Computer Vision, pp 1080–1088
Yuan X, Xie L, Abouelenien M (2018) A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data. Pattern Recognit 77:160–172
Article Google Scholar
Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, p 4
Zeiler MD, Krishnan D, Taylor GW, Fergus R (2010) Deconvolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2528–2535
Kantorov V, Oquab M, Cho, Laptev I (2016) ContextLocNet: context-aware deep network models for weakly supervised localization. In: European Conference on Computer Vision, pp 350–365
Everingham M, Zisserman A, Williams CK et al The PASCAL visual object classes challenge 2007 (VOC2007) results. In: International Conference on Machine Learning Challenges: Evaluating Predictive Uncertainty Visual Object Classification and Recognizing Textual Entailment. Springer, Berlin, pp 117–176
Deng J, Dong W, Socher R, Li LJ, Li K, Li FF (2009) ImageNet: a large-scale hierarchical image database. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp 248–255

Download references

Acknowledgements

This work is supported by the Natural Science Foundation of Anhui Province (1708085MF146), Science and Technology Support Project of Sichuan Province (2016GZ0389), Project of Innovation Team of Ministry of Education of China (IRT17R32) and the Fundamental Research Funds for the Central Universities (No. PA2018GDQT0011).

Author information

Authors and Affiliations

School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, 230601, Anhui, China
Baofu Fang & Lu Fang

Authors

Baofu Fang
View author publications
Search author on:PubMed Google Scholar
Lu Fang
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Baofu Fang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fang, B., Fang, L. Concise feature pyramid region proposal network for multi-scale object detection. J Supercomput 76, 3327–3337 (2020). https://doi.org/10.1007/s11227-018-2569-1

Download citation

Published: 30 August 2018
Issue Date: May 2020
DOI: https://doi.org/10.1007/s11227-018-2569-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Concise feature pyramid region proposal network for multi-scale object detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Learning Discriminated Features Based on Feature Pyramid Networks and Attention for Multi-scale Object Detection

Consistent scale normalization for object perception

Multi-level feature fusion pyramid network for object detection

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now