Abstract
Although existing remote sensing image object detection methods have made significant evolution in deep learning, they did not fully consider the problem of features loss caused by the correspondingly different importance of different channels of feature maps in the convolution pooling. Therefore, a one-stage deep channels attention network for remote sensing images object detection was proposed. First, through a multi-scale feature representation of the Single Shot MultiBox Detector (SSD) Network, the model can combine semantic information with detailed features to better integrate feature layers with different resolutions. Second, for each additional feature extraction layer, the squeeze and excitation (SE) module is introduced, which adaptively re-calibrates the interdependencies between deep channels, then they achieve the response of channel properties in order to learn more efficient feature information. According to experimental results on the RSOD dataset and NWPU VHR-10 dataset, the models proposed in this paper all realize advanced results and achieve state-of-the-art technical performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen, S., Zhan, R., Zhang, J.: Geospatial object detection in remote sensing imagery based on multiscale single-shot detector with activated semantics. Remote Sens. 10(6), 820 (2018)
Cheng, G., Han, J., Zhou, P., Guo, L.: Multi-class geospatial object detection and geographic image classification based on collection of part detectors. ISPRS J. Photogrammetry Remote Sens. 98, 119–132 (2014)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Henderson, P., Ferrari, V.: End-to-End Training of Object Class Detectors for Mean Average Precision. In: Lai, S.-H., Lepetit, V., Nishino, K., Sato, Y. (eds.) ACCV 2016. LNCS, vol. 10115, pp. 198–213. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-54193-8_13
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Huang, W., Li, G., Chen, Q., Ju, M., Qu, J.: Cf2pn: a cross-scale feature fusion pyramid network based remote sensing target detection. Remote Sens. 13(5), 847 (2021)
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Long, Y., Gong, Y., Xiao, Z., Liu, Q.: Accurate object localization in remote sensing images based on convolutional neural networks. IEEE Trans. Geosci. Remote Sens. 55(5), 2486–2498 (2017)
Lu, G., Gan, J., Yin, J., Luo, Z., Li, B., Zhao, X.: Multi-task learning using a hybrid representation for text classification. Neural Comput. Appl. 32(11), 6467–6480 (2020)
Lu, G., Li, J., Wei, J.: Aspect sentiment analysis with heterogeneous graph neural networks. Inf. Proces. Manage. 59(4), 102953 (2022)
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., Savarese, S.: Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 658–666 (2019)
Xu, C., Li, C., Cui, Z., Zhang, T., Yang, J.: Hierarchical semantic propagation for object detection in remote sensing imagery. IEEE Trans. Geosci. Remote Sens. 58(6), 4353–4364 (2020)
Yang, X., et al.: R2CNN++: multi-dimensional attention based rotation invariant detector with robust anchor strategy. 2, 7 (2018) arXiv preprint arXiv:1811.07126
Zhang, W., Jiao, L., Li, Y., Huang, Z., Wang, H.: Laplacian feature pyramid network for object detection in VHR optical remote sensing images. In: IEEE Trans. Geosci. Remote Sens. (2021)
Zhou, K., Zhang, Z., Gao, C., Liu, J.: Rotated feature network for multiorientation object detection of remote-sensing images. IEEE Geosci. Remote Sens. Lett. 18(1), 33–37 (2020)
Zhou, L., et al.: Aircraft detection for remote sensing image based on bidirectional and dense feature fusion. Comput. Intell. Neurosci. 2021, 14 (2021)
Acknowledgements
This work is partially supported by the Project of Guangxi Science and Technology (GuiKeAD20159041), the Research Fund of Guangxi Key Lab of Multi-source Information Mining & Security (No.20-A-01–01, No.20-A-01–02, MIMS21-M-01, MIMS20-M-01, MIMS20-04) and the Innovation Project of Guangxi Graduate Education (YCSW2022124); the Guangxi “Bagui” Teams for Innovation and Research, China, the Guangxi Collaborative Innovation Center of Multi-Source Information Integration and Intelligent Processing.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Tang, J., Zhang, W., Zhang, G., Liang, R., Lu, G. (2023). One-Stage Deep Channels Attention Network for Remote Sensing Images Object Detection. In: Li, B., Yue, L., Tao, C., Han, X., Calvanese, D., Amagasa, T. (eds) Web and Big Data. APWeb-WAIM 2022. Lecture Notes in Computer Science, vol 13422. Springer, Cham. https://doi.org/10.1007/978-3-031-25198-6_36
Download citation
DOI: https://doi.org/10.1007/978-3-031-25198-6_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25197-9
Online ISBN: 978-3-031-25198-6
eBook Packages: Computer ScienceComputer Science (R0)