Abstract
The bird’s eye view, multi-scale and dense classes in remote sensing images challenge the object detection of remote sensing images. It is not satisfactory to directly apply the object detection method designed for natural scene images to the object detection of remote sensing images. In this paper, we propose a detector with enhanced feature extraction ability to solve the above challenges, namely TWDFPN. TWDFPN has designed the structure of a two-way feature pyramid network (TWFPN) by combining feature maps with different generation directions and different spatial resolutions, which not only improves the utilization of the underlying feature information, but also strengthens the repeated utilization of the feature information of the backbone network, and ultimately improves the feature extraction ability of the network. Meanwhile, the dense-connected module is used in TWFPN to enhance the feature representation ability through limited additional computation cost, which extends the network and deepens the network. To evaluate the effectiveness of the proposed algorithm, this paper carried out experiments on NWPUVHR-10 and RSOD public remote sensing datasets, and the average accuracy (mAP) of 92.98% and 96.16%, respectively, which achieves advanced performance.
Similar content being viewed by others
References
Shivappriya SN, Priyadarsini MJP, Stateczny A et al (2021) Cascade object detection and remote sensing object detection method based on trainable activation function. Remote Sens 13(2):200
Li K, Wan G, Cheng G et al (2020) Object detection in optical remote sensing images: a survey and a new benchmark. ISPRS J Photogramm Remote Sens 159:296–307
Jiao L, Zhang F, Liu F et al (2019) A survey of deep learning-based object detection. IEEE Access 7:128837–128868
Sun P, Zheng Y, Zhou Z et al (2020) R4 Det: refined single-stage detector with feature recursion and refinement for rotating object detection in aerial images. Image Vis Comput 103:104036
Zou Z, Chen K, Shi Z, et al. (2023) Object detection in 20 years: a survey. In: Proceedings of the IEEE
Zhang M, Chen Y, Liu X et al (2020) Adaptive anchor networks for multi-scale object detection in remote sensing images. IEEE Access 8:57552–57565
Liu L, Ouyang W, Wang X et al (2020) Deep learning for generic object detection: a survey. Int J Comput Vis 128:261–318
Bochkovskiy A, Wang CY, Liao HYM (2020) Yolov4: optimal speed and accuracy of object detection. arXiv:2004.10934
Li Z, Wang Y, Zhang N et al (2022) Deep learning-based object detection techniques for remote sensing images: a survey. Remote Sens 14(10):2385
Lin TY, Doll´ar P, Girshick R, et al. (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision pattern recognition, 2117–2125
Liu S, Qi L, Qin H, et al. (2018) Path aggregation network for instance segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 8759–8768
Tan M, Pang R, Le QV (2020) Efficientdet: scalable and efficient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 10781–10790
Ghaisi G, Lin TY, Pang R, et al. Learning scalable feature pyramid architecture for object detection. In: Proceedings of the IEEE computer vision and pattern recognition, 7029–7038
Zhang X, Zhu K, Chen G et al (2019) Geospatial object detection on high resolution remote sensing imagery based on double multi-scale feature pyramid network. Remote Sens 11(7):755
Fu K, Chen Z, Zhang Y et al (2019) Enhanced feature representation in detection for optical remote sensing images. Remote Sens 11(18):2095
Wang C, Bai X, Wang S et al (2018) Multiscale visual attention networks for object detection in VHR remote sensing images. IEEE Geosci Remote Sens Lett 16(2):310–314
Qu J, Su C, Zhang Z et al (2020) Dilated convolution and feature fusion SSD network for small object detection in remote sensing images. IEEE Access 8:82832–82843
Su H, Wei S, Liu S et al (2020) HQ-ISNet: high-quality instance segmentation for remote sensing imagery. Remote Sens 12(6):989
Lin Y, He H, Yin Z et al (2014) Rotation-invariant object detection in remote sensing images based on radial-gradient angle. IEEE Geosci Remote Sens Lett 12(4):746–750
Feng C, Cao Z, Xiao Y et al (2023) Multi-spectral template matching based object detection in a few-shot learning manner. Inf Sci 624:20–36
Ok AO (2013) Automated detection of buildings from single VHR multispectral images using shadow information and graph cuts. ISPRS J Photogramm Remote Sens 86:21–40
Lin Z, Zhu F, Kong Y et al (2022) SRSG and S2SG: a model and a dataset for scene graph generation of remote sensing images from segmentation results. IEEE Trans Geosci Remote Sens 60:1–11
Blaschke T (2010) Object based image analysis for remote sensing. ISPRS J Photogramm Remote Sens 65(1):2–16
Garajeh MK, Feizizadeh B, Blaschke T et al (2022) Detecting and mapping karst landforms using object-based image analysis: case study: Takht-Soleiman and Parava Mountains, Iran. Egypt J Remote Sens Space Sci 25(2):473–489
Li Y, Wang S, Tian Q et al (2015) Feature representation for statistical- learning-based object detection: a review. Pattern Recognit 48(11):3542–3559
Mahadevkar SV, Khemani B, Patil S, et al. (2022) A review on machine learning styles in computer vision-techniques and future directions. IEEE Access
Girshick R, Donahue J, Darrell T, et al. (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 580–587
Ren S, He K, Girshick R, et al. (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28
Lin TY, Goyal P, Girshick R, et al. (2017) Focal loss for dense object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2980–2988
He K, Gkioxari G, Doll´ar P, et al. (2017) Mask r-cnn. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2961–2969
Redmon J, Divvala S, Girshick R, et al. (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 779–788
Redmon J, Farhadi A (2018) Yolov3: an incremental improvement. arXiv:1804.02767
Liu W, Anguelov D, Erhan D et al (2016) Ssd: single shot multibox detector. Springer, Berlin, pp 21–37
Zhang F, Du B, Zhang L et al (2016) Weakly supervised learning based on coupled convolutional neural networks for aircraft detection. IEEE Trans Geosci Remote Sens 54(9):5553–5563
Pang J, Li C, Shi J, et al. (2019) R2-CNN: fast tiny object detection in large- scale remote sensing images. arXiv:1902.06042
Li Y, Huang Q, Pei X et al (2020) RADet: refine feature pyramid network and multi-layer attention network for arbitrary-oriented object detection of remote sensing images. Remote Sens 12(3):389
Li C, Cong R, Guo C et al (2020) A parallel down-up fusion network for salient object detection in optical remote sensing images. Neurocomputing 415:411–420
Qiao S, Chen LC, Yuille A (2021) Detectors: detecting objects with recursive feature pyramid and switchable atrous convolution. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 10213–10224
He K, Zhang X, Ren S, et al. (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778
Huang G, Liu Z, Van Der Maaten L, et al. (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 4700–4708
Cheng G, Han J, Zhou P et al (2014) Multi-class geospatial object detection and geographic image classification based on collection of part detectors. ISPRS J Photogramm Remote Sens 98:119–132
Xiao Z, Liu Q, Tang G et al (2015) Elliptic Fourier transformation-based histograms of oriented gradients for rotationally invariant object detection in remote-sensing images. Int J Remote Sens 36(2):618–644
Ge Z, Liu S, Wang F, et al. (2021) Yolox: exceeding yolo series in 2021. arXiv:2107.08430
Zhang K, Shen H (2022) Multi-stage feature enhancement pyramid network for detecting objects in optical remote sensing images. Remote Sens 14(3):579
Acknowledgements
This work is partially supported by the Heilongjiang Provincial Natural Science Foundation (LH2022F047) and the Special Fund of Fundamental Scientific Research Business Expense for Higher School of Heilongjiang Province (2021-KYYWF-0002).
Author information
Authors and Affiliations
Contributions
HL contributed to methodology, software, investigation, writing—original draft, and data Curation. HM contributed to conceptualization, writing—review and editing, supervision, and data Curation. YC contributed to software ZY contributed to resources. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, H., Ma, H., Che, Y. et al. A two-way dense feature pyramid networks for object detection of remote sensing images. Knowl Inf Syst 65, 4847–4871 (2023). https://doi.org/10.1007/s10115-023-01916-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-023-01916-4