One-Stage Deep Channels Attention Network for Remote Sensing Images Object Detection

Tang, Jinyun; Zhang, Wenzhen; Zhang, Guixian; Liang, Rongjiao; Lu, Guangquan

doi:10.1007/978-3-031-25198-6_36

Jinyun Tang¹³,
Wenzhen Zhang¹³,
Guixian Zhang¹³,
Rongjiao Liang¹³ &
…
Guangquan Lu¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13422))

Included in the following conference series:

Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data

722 Accesses

Abstract

Although existing remote sensing image object detection methods have made significant evolution in deep learning, they did not fully consider the problem of features loss caused by the correspondingly different importance of different channels of feature maps in the convolution pooling. Therefore, a one-stage deep channels attention network for remote sensing images object detection was proposed. First, through a multi-scale feature representation of the Single Shot MultiBox Detector (SSD) Network, the model can combine semantic information with detailed features to better integrate feature layers with different resolutions. Second, for each additional feature extraction layer, the squeeze and excitation (SE) module is introduced, which adaptively re-calibrates the interdependencies between deep channels, then they achieve the response of channel properties in order to learn more efficient feature information. According to experimental results on the RSOD dataset and NWPU VHR-10 dataset, the models proposed in this paper all realize advanced results and achieve state-of-the-art technical performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chen, S., Zhan, R., Zhang, J.: Geospatial object detection in remote sensing imagery based on multiscale single-shot detector with activated semantics. Remote Sens. 10(6), 820 (2018)
Article Google Scholar
Cheng, G., Han, J., Zhou, P., Guo, L.: Multi-class geospatial object detection and geographic image classification based on collection of part detectors. ISPRS J. Photogrammetry Remote Sens. 98, 119–132 (2014)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Henderson, P., Ferrari, V.: End-to-End Training of Object Class Detectors for Mean Average Precision. In: Lai, S.-H., Lepetit, V., Nishino, K., Sato, Y. (eds.) ACCV 2016. LNCS, vol. 10115, pp. 198–213. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-54193-8_13
Chapter Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Google Scholar
Huang, W., Li, G., Chen, Q., Ju, M., Qu, J.: Cf2pn: a cross-scale feature fusion pyramid network based remote sensing target detection. Remote Sens. 13(5), 847 (2021)
Article Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Long, Y., Gong, Y., Xiao, Z., Liu, Q.: Accurate object localization in remote sensing images based on convolutional neural networks. IEEE Trans. Geosci. Remote Sens. 55(5), 2486–2498 (2017)
Article Google Scholar
Lu, G., Gan, J., Yin, J., Luo, Z., Li, B., Zhao, X.: Multi-task learning using a hybrid representation for text classification. Neural Comput. Appl. 32(11), 6467–6480 (2020)
Article Google Scholar
Lu, G., Li, J., Wei, J.: Aspect sentiment analysis with heterogeneous graph neural networks. Inf. Proces. Manage. 59(4), 102953 (2022)
Article Google Scholar
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., Savarese, S.: Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 658–666 (2019)
Google Scholar
Xu, C., Li, C., Cui, Z., Zhang, T., Yang, J.: Hierarchical semantic propagation for object detection in remote sensing imagery. IEEE Trans. Geosci. Remote Sens. 58(6), 4353–4364 (2020)
Article Google Scholar
Yang, X., et al.: R2CNN++: multi-dimensional attention based rotation invariant detector with robust anchor strategy. 2, 7 (2018) arXiv preprint arXiv:1811.07126
Zhang, W., Jiao, L., Li, Y., Huang, Z., Wang, H.: Laplacian feature pyramid network for object detection in VHR optical remote sensing images. In: IEEE Trans. Geosci. Remote Sens. (2021)
Google Scholar
Zhou, K., Zhang, Z., Gao, C., Liu, J.: Rotated feature network for multiorientation object detection of remote-sensing images. IEEE Geosci. Remote Sens. Lett. 18(1), 33–37 (2020)
Article Google Scholar
Zhou, L., et al.: Aircraft detection for remote sensing image based on bidirectional and dense feature fusion. Comput. Intell. Neurosci. 2021, 14 (2021)
Google Scholar

Download references

Acknowledgements

This work is partially supported by the Project of Guangxi Science and Technology (GuiKeAD20159041), the Research Fund of Guangxi Key Lab of Multi-source Information Mining & Security (No.20-A-01–01, No.20-A-01–02, MIMS21-M-01, MIMS20-M-01, MIMS20-04) and the Innovation Project of Guangxi Graduate Education (YCSW2022124); the Guangxi “Bagui” Teams for Innovation and Research, China, the Guangxi Collaborative Innovation Center of Multi-Source Information Integration and Intelligent Processing.

Author information

Authors and Affiliations

Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, Guilin, 541004, China
Jinyun Tang, Wenzhen Zhang, Guixian Zhang, Rongjiao Liang & Guangquan Lu

Authors

Jinyun Tang
View author publications
You can also search for this author in PubMed Google Scholar
Wenzhen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guixian Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Rongjiao Liang
View author publications
You can also search for this author in PubMed Google Scholar
Guangquan Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guangquan Lu .

Editor information

Editors and Affiliations

Nanjing University of Aeronautics and Astronautics, Nanjing, China
Bohan Li
Newcastle University, Callaghan, NSW, Australia
Lin Yue
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Chuanqi Tao
Jinan University, Guangzhou, China
Xuming Han
Free University of Bozen-Bolzano, Bolzano, Italy
Diego Calvanese
University of Tsukuba, Tsukuba, Japan
Toshiyuki Amagasa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tang, J., Zhang, W., Zhang, G., Liang, R., Lu, G. (2023). One-Stage Deep Channels Attention Network for Remote Sensing Images Object Detection. In: Li, B., Yue, L., Tao, C., Han, X., Calvanese, D., Amagasa, T. (eds) Web and Big Data. APWeb-WAIM 2022. Lecture Notes in Computer Science, vol 13422. Springer, Cham. https://doi.org/10.1007/978-3-031-25198-6_36

Download citation

DOI: https://doi.org/10.1007/978-3-031-25198-6_36
Published: 10 February 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25197-9
Online ISBN: 978-3-031-25198-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

One-Stage Deep Channels Attention Network for Remote Sensing Images Object Detection