Building detection algorithm in multi-scale remote sensing images based on attention mechanism

Ding, Wei; Zhang, Li; Yang, Guangliang

doi:10.1007/s12065-023-00849-9

Building detection algorithm in multi-scale remote sensing images based on attention mechanism

Special Issue
Published: 28 April 2023

Volume 16, pages 1717–1728, (2023)
Cite this article

Evolutionary Intelligence Aims and scope Submit manuscript

257 Accesses
Explore all metrics

Abstract

The seismic fortification intensity requirements of buildings of various heights are different. Identifying the heights of different buildings using a specific method can reduce secondary disasters such as building damage and house collapse caused by destructive earthquakes. The traditional building object detection model uses the feature expression of the dataset based on manually sketching the shape of the building in the remote sensing images, which will cause less acquired image information, unclear target features of buildings, and the decrease of target detection accuracy of identifying building types. The real-time target detection algorithm YOLO has fast detection speed and high accuracy, but its main disadvantage is the inaccurate positioning of the building bounding box and the misjudgment of partially overlapping building objects. Image feature retrieval using large data set domain adaptive fast algorithm is a new research hotspot in current image recognition. This research proposes a multi-branch network with the addition of a module using attention mechanism to coordinate the salient and sub-salient information of images. The algorithm adds channel and spatial attention mechanisms to the feature extraction network, and uses weighting and filtering methods to perform residual fusion on the original features vectors in images. The model is deployed on the remote sensing geographic information evaluation platform to conduct various building target detection experiments. The average detection accuracy of the urban buildings can reach 81.8%, and the detection speed for remote sensing segmentation images is 26 per second. The mAP@0.5 of the YOLOv5 algorithm with the attention mechanism is 12% higher than that of the unimproved YOLOv5 algorithm on the urban street view image dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Remote Sensing Image Detection Based on Attention Mechanism and YOLOv5

Arbitrary-oriented object detection via dense feature fusion and attention model for remote sensing super-resolution image

Article 07 May 2020

Visual attention model based mining area recognition on massive high-resolution remote sensing images

Article 15 March 2015

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Fang E, Qian J (1999) Issues of seismic resistant design for tall buildings in China. China Civil Engineering Journal 32:3–8
Google Scholar
Wang Y (2009) An overview of recent revision of seismic design codes after the 5.12 Wenchuan Earthquake. China Civil Engineering Journal 42(5):1–8. https://doi.org/10.1007/978-3-540-85168-452
Article Google Scholar
miao ZD, Ming WD (2021) Application of building spatial distribution method based on remote sensing image in earthquake disaster loss pre-assessment. Technol Earthq Disaster Prev 16:510–518
Google Scholar
Huo FF, Hou ML, Yang S (2019) Automatic extraction of building Rooftop outlines using airborne LiDAR: a review. Geomatics World 26:1–13
Google Scholar
Reite A, Kangas S, Steck Z (2019) Unsupervised Feature Learning in Remote Sensing, Computer Vision and Pattern Recognition, DOI: https://doi.org/10.1117/12.2529791
Bai SY, Agethen S, Chao TH, Hsu W (2015) Semi-supervised learning for convolutional neural networks via online graph construction, Computer Science, arXiv:1511.06104
Zang N, Cao Y, Wang Y (2021) Land-use mapping for high-spatial resolution remote sensing image via deep learning: a review. IEEE J Sel Top Appl Earth Observations Remote Sens 99(1):1–10
Google Scholar
Gong M, Zhou Z, Ma J (2012) Change detection in synthetic aperture radar images based on image fusion and fuzzy clustering. IEEE Trans Image Process 21(4):2141–2151
Article MathSciNet MATH Google Scholar
Gong M, Su L, Jia M (2014) Fuzzy clustering with a modified MRF energy function for change detection in synthetic aperture radar images. IEEE Trans Fuzzy Syst 22(1):98–109
Article Google Scholar
Lu-yang L, Jia Z-h, Yang J (2019) and K. Nikola, SAR image change detection using double difference images and PCA algorithm, Computer Engineering and Design, vol. 40 (7):5–15 DOI: https://doi.org/10.16208/j.issn10007024.2019.07.033
Wang Y, Ma H, Alifu K (2021) Remote sensing image description based on word embedding and end-to-end deep learning. Sci Rep 11(1):1–10
Google Scholar
Nogueira K, Penatti O, Santos J (2016) Towards Better exploiting Convolutional neural networks for remote sensing scene classification. Pattern Recogn 61(1):539–556
Google Scholar
Cheng G, Yang C, Yao X (2018) When deep learning meets metric learning: remote sensing image scene classification via learning discriminative CNNs. IEEE Trans Geosci Remote Sens 56(5):2811–2821
Article Google Scholar
Zhang Yy, Fei XY, Wang J (2020) Suvery of building extraction methods based on high resolution remote sensing images. Geomatics & Spatical Information Technology 43(4):76–79
Google Scholar
Liu J, Mao ZY (2009) A survey on segmentation techniques and application strategy of high spatial resolution remote sensing imagery. Remote Sens Inform 24(6):95–101. https://doi.org/10.3969/j.issn.10003177.2009.06.020
Article Google Scholar
Jung RSCR (2004) Rectangle detection based on a windowed hough Transform, in Brazilian Symposium on Computer Graphics & Image Processing (SIBGRAPI), 113–120
Kawahito K (2008) Recognition-driven two-dimensional competing priors toward automatic and accurate building detection. IEEE Trans Geoscience Remote Sens 47(1):133–144. https://doi.org/10.1109/tgrs.2008.2002027
Article Google Scholar
Zhao CX, Chen X, Yang L (2016) Optimization of aerial DSM with building boundary constraint. Remote Sens Inform 31(4):16–21. https://doi.org/10.3969/j.issn.1000-3177.2016.04.003
Article Google Scholar
Redmon J, Farhadi A (2018) YOLOv3: An incremental improvement, arXiv e-prints, Doi https://arxiv.org/pdf/1804.02767.pdf
Li S, Gu X, Xu X, Xu D, Dong Q (2021) Detection of concealed cracks from ground penetrating radar images based on deep learning algorithm. Constr Build Mater 273:121949. https://doi.org/10.1016/j.conbuildmat.2020.121949
Article Google Scholar
Cai Y, Luan T, Gao H (2021) YOLOv4-5D: an effective and efficient object detector for Autonomous Driving. IEEE Trans Instrum Meas 70(1):1–10. https://doi.org/10.1109/TIM.2021.3065438
Article Google Scholar
Rezatofighi H, Tsoi N, Gwak JY (2019) Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 1–10
Woo S, Park J, Lee J-Y, Kweon IS (2018) CBAM: Convolutional Block Attention Module, in 15th Computer Vision and Pattern Recognition (ECCV), Munich, Germany
Bottou Le, Curtis FE, Nocedal J (2016) Optimization methods for large-scale machine learning. Siam Rev 60(2):1–95. https://doi.org/10.1137/16M1080173
Article MathSciNet MATH Google Scholar
Rezende DJ, Mohamed S, Wierstra D (2014) Stochastic Backpropagation and Approximate Inference in Deep Generative Models, in 31st International Conference on Machine Learning (ICML), Beijing, China, 1–15
Wang G, Duan M, Niu C (2018) Stochastic gradient descent algorithm based on convolution neural network, Computer Engineering and Design, 39 (2): 441–445, DOI:https://doi.org/10.16208/j.issn1000-7024.2018.02.026
Xu F, Dong P, Wang H (2019) Intelligent detection and autonomous capture systemof seafood based on underwater robot. J Beijing Univ Aeronaut Astronaut 45(12):2393–2402. https://doi.org/10.13700/j.bh.1001-5965.2019.0377
Article Google Scholar
Xu X, Ma M, Thompson SG (2021) Intelligent co-detection of cyclists and motorcyclists based on an improved deep learning method. Meas Sci Technol 32(2):025402. https://doi.org/10.1088/13616501/abb745
Article Google Scholar
Alhichri H, Alsuwayed A, Y YB (2021) Classification of Remote sensing images using EfficientNet-B3 CNN model with attention. IEEE Access 9(1):14078–14094. https://doi.org/10.1109/ACCESS.2021.3051085
Article Google Scholar
Wang Q, Wu B, Zhu P ECA-net: Efficient channel attention for deep convolutional neural networks, in 2020 IEEE/CVF Conference on Computer Vision and, Recognition P (2020) (CVPR), Seattle, WA, USA, 1–10

Download references

Acknowledgements

This work is partially sponsored by funding from the National Natural Science Foundation of China [Grant No. 42174104, U1939204, 41204014], the Natural Science Foundation of Hubei province of China [Grant No.ZRMS2020000813], and the open fund of Wuhan gravitation and solid earth tides national observation and research station [No.WHYWZ202109].

Author information

Wei Ding contributed equally to this work.

Authors and Affiliations

Key Laboratory of Earthquake Geodesy, Institute of Seismology, CEA, Hongshan Ce Road, Wuhan, 430071, Hubei, China
Wei Ding & Li Zhang
Gravitation and Earth Tide, National Observation and Research Station, Wuhan, China
Wei Ding & Guangliang Yang
Hubei Earthquake Administration, Wuhan, 430071, China
Li Zhang & Guangliang Yang

Authors

Wei Ding
View author publications
You can also search for this author in PubMed Google Scholar
Li Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guangliang Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Li Zhang.

Ethics declarations

Conflict of interest

The authors declared that they have no conflicts of interest to this work. We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Supplementary Material 4

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ding, W., Zhang, L. & Yang, G. Building detection algorithm in multi-scale remote sensing images based on attention mechanism. Evol. Intel. 16, 1717–1728 (2023). https://doi.org/10.1007/s12065-023-00849-9

Download citation

Received: 22 March 2022
Revised: 08 February 2023
Accepted: 06 April 2023
Published: 28 April 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s12065-023-00849-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Building detection algorithm in multi-scale remote sensing images based on attention mechanism

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Remote Sensing Image Detection Based on Attention Mechanism and YOLOv5

Arbitrary-oriented object detection via dense feature fusion and attention model for remote sensing super-resolution image

Visual attention model based mining area recognition on massive high-resolution remote sensing images

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Supplementary Material 4

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now