Abstract
Building extraction is very essential in various urban dynamics like disaster management and change detection, finding the estimated population, and so on. Building extraction from satellite data is a challenging task as the images may be subjected to different illumination or structure due to very large variations of the appearance of buildings which may correspond to the different area/terrain. Although satellite imagery is readily available from various sources, translating the imagery includes intensive effort. Many computer-vision tasks have been carried out successfully but understanding the impact of them on building extraction with remote sensing imagery is a growing need.To overcome this kind of problem, an algorithm is proposed which extends the convolutional neural network for pixel-wise classification of images. Furthermore, to resolve the problem of extraction and masking of images, Mask-RCNN (i.e., Mask Region-based Convolutional Neural Network) algorithm is used which makes this process easier and more efficient.The model is trained on a complex dataset that is significantly larger. Also, to make this algorithm more scalable, an advanced image augmentation technique is used in the pre-processing step.The results show that the algorithm achieves better performance in terms of accuracy.






Similar content being viewed by others
Data availability
The data samples have been taken using crowdAI.
Code availability
The relevant code with the manuscript is also avail- able and would be available, if will be asked to do so later.
References
Azimi SM et al (2019) Towards multi-class object detection in unconstrained remote sensing imagery. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11363 LNCS, pp 150–165. https://doi.org/10.1007/978-3-030-20893-6_10
Ardila JP et al (2012)Context-sensitive extraction of tree crown objects in urban areas using VHR satellite images. Int J Appl Earth Obs Geoinf. Elsevier B.V. 15(1):57–69. https://doi.org/10.1016/j.jag.2011.06.005
Akter S, Wamba SF (2016) ‘Big data analytics in E-commerce: a systematic review and agenda for future research. Electron Markets Electron Markets 26(2):173–194. https://doi.org/10.1007/s12525-016-0219-0
Chen K, Fu K, Gao X, Yan M, Sun X, Zhang H (2017) Building extraction from remote sensing images with deep learning in a supervised manner. In 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). IEEE, pp 1672–1675
Cook K, Wright J (1975) Transmission Systems. 20(January 15, 1975), pp 219–223. https://doi.org/10.1049/pbte071e_ch4
Duan Y, Sun L (2019) Buildings extraction from remote sensing data using deep learning method based on improved U-Net network. In: IGARSS 2019–2019 IEEE International Geoscience and Remote Sensing Symposium. IEEE, pp 3959–3961
Gupta A, Anand R, Pandey D, Sindhwani N, Wairya S, Pandey BK, Sharma M (2021) Prediction of breast cancer using Extremely Randomized Clustering Forests (ERCF) Technique: Prediction of breast cancer. Int J Distrib Syst Technol (IJDST) 12(4):1–15
Ghiasi G, Fowlkes CC (2016) Laplacian pyramid reconstruction and refinement for semantic segmentation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9907 LNCS, pp 519–534. https://doi.org/10.1007/978-3-319-46487-9_32
Güler RA, Neverova N, Kokkinos I (2018) DensePose: Dense human pose estimation in the wild. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 7297–7306. https://doi.org/10.1109/CVPR.2018.00762
Girshick R (2015) Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision. 2015 Inter, pp 1440–1448. https://doi.org/10.1109/ICCV.2015.169
Huang Z, Cheng G, Wang H, Li H, Shi L, Pan C (2016) Building extraction from multi-source remote sensing images via deep deconvolution neural networks. In: 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). IEEE, pp 1835–1838
Huang X, Zhang L (2012) Morphological building/shadow index for building extraction from high-resolution imagery over urban areas. IEEE J Sel Top Appl Earth Obs Remote Sens 5(1):161–172. https://doi.org/10.1109/JSTARS.2011.2168195
He K et al (2017) Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision, 2017-Octob, pp 2980–2988. https://doi.org/10.1109/ICCV.2017.322
Hui J, Du M, Ye X, Qin Q, Sui J (2018) Effective building extraction from high-resolution remote sensing images with multitask driven deep neural network. IEEE Geosci Remote Sens Lett 16(5):786–790
Jin X, Davis CH (2005) Automated building extraction from high-resolution satellite imagery in Urban areas using structural, contextual, and spectral information. EURASIP J Appl Sig Process 2005(14):2196–2206. https://doi.org/10.1155/ASP.2005.2196
Liu Z et al (2015) ‘Deep learning face attributes in the wild. Proceedings of the IEEE International Conference on Computer Vision, 2015 Inter, pp 3730–3738. https://doi.org/10.1109/ICCV.2015.425
Liu Y, Zhou J, Qi W, Li X, Gross L, Shao Q, ... Li Z (2020) ARC-Net: an efficient network for building extraction from high-resolution aerial images. IEEE Access 8:154997–155010
Li W et al (2006) A novel framework for urban change detection using VHR satellite images. Proceedings - International Conference on Pattern Recognition 2, pp 312–315. https://doi.org/10.1109/ICPR.2006.138
Li X, Yao X, Fang Y (2018) Building-a-nets: Robust building extraction from high-resolution remote sensing images with adversarial networks. IEEE J Sel Top Appl Earth Obs Remote Sens 11(10):3680–3687
Liu Y, Gross L, Li Z, Li X, Fan X, Qi W (2019) Automatic building extraction on high-resolution remote sensing imagery using deep convolutional encoder-decoder with spatial pyramid pooling. IEEE Access 7:128774–128786
Marmanis D, Adam F, Datcu M, Esch T, Stilla U (2015) Deep neural networks for above-ground detection in very high spatial resolution digital elevation models. ISPRS Ann Photogramm Remote Sens Spat Inf Sci 2(3):103
Majd RD, Momeni M, Moallem P (2019) Transferable object-based framework based on deep convolutional neural networks for building extraction. IEEE J Sel Top Appl Earth Observations Remote Sens 12(8):2627–2635
Meivel S, Sindhwani N, Anand R, Pandey D, Alnuaim AA, Altheneyan AS, ... Lelisho ME (2022) Mask detection and social distance identification using internet of things and faster R-CNN algorithm. Comput Intell Neurosci 2022:2103975. https://doi.org/10.1155/2022/2103975
Madhumathy P, Pandey D (2022) Deep learning based photo acoustic imaging for non-invasive imaging. Multimed Tools Appl. https://doi.org/10.1007/s11042-022-11903-6
Pandey D, Pandey BK (2022) An efficient deep neural network with adaptive galactic swarm optimization for complex image text extraction. Process Mining Techniques for Pattern Recognition. CRC Press, pp 121–137
Pandey BK, Pandey D, Wariya S, Aggarwal G, Rastogi R (2021) Deep learning and particle swarm optimisation-based techniques for visually impaired humans’ text recognition and identification. Augmented Hum Res 6(1):1–14
Pal SK, Mitra S (1992) Multilayer perceptron, fuzzy sets, and classification. IEEE Trans Neural Netw 3(5):683–697. https://doi.org/10.1109/72.159058
Pandey BK, Mane D, Nassa VKK, Pandey D, Dutta S, Ventayen RJM, ... Rastogi R (2021) Secure text extraction from complex degraded images by applying steganography and deep learning. In: Multidisciplinary Approach to Modern Digital Steganography. IGI Global, pp 146-163
Pandey BK, Pandey D, Wariya S, Agarwal G (2021) A deep neural network-based approach for extracting textual images from deteriorate images. EAI Endorsed Trans Ind Netw Intell Syst 8(28):e3
Papadomanolaki M et al (2016) Benchmarking deep learning frameworks for the classification of very high resolution satellite multispectral data. ISPRS Ann Photogramm Remote Sens Spat Inf Sci III–7(July):83–88. https://doi.org/10.5194/isprsannals-iii-7-83-2016
Pandey D, Pandey B, Wairya S (2021) Hybrid deep neural network with adaptive galactic swarm optimization for text extraction from scene images. Soft Comput 25:1563–1580. https://doi.org/10.1007/s00500-020-05245-4
Sindhwani N, Verma S, Bajaj T, Anand R (2021) Comparative analysis of intelligent driving and safety assistance systems using YOLO and SSD model of deep learning. Int J Inform Syst Model Des (IJISMD) 12(1):131–146
Vakalopoulou M et al (2015) Building detection in very high resolution multispectral data with deep learning features. International Geoscience and Remote Sensing Symposium (IGARSS), 2015-Novem, pp 1873–1876. https://doi.org/10.1109/IGARSS.2015.7326158
Wang Y, Gu L, Li X, Ren R (2020) Building extraction in multitemporal high-resolution remote sensing imagery using a multifeature LSTM network.IEEE Geosci Remote Sensing Lett. https://doi.org/10.1109/LGRS.2020.3005018
Wang M, Yuan S, Pan J (2013) Building detection in high resolution satellite urban image using segmentation, corner detection combined with adaptive windowed Hough Transform. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote. Geoscience and Remote Sensing Symposium (IGARSS), 2013 IEEE International, pp 508–511. https://doi.org/10.1109/IGARSS.2013.6721204
Wang X, Shrivastava A, Gupta A (2017) A-Fast-RCNN: Hard positive generation via adversary for object detection. Proceedings – 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, 2017-Janua, pp 3039–3048. https://doi.org/10.1109/CVPR.2017.324
Xu Y et al (2018) Building extraction in very high resolution remote sensing imagery using deep learning and guided filters. Remote Sens 10(1). https://doi.org/10.3390/rs10010144
Yuan J (2016) Automatic building extraction in aerial scenes using convolutional networks. arXiv preprint arXiv:1602.06564
Singh SK, Thakur RK, Kumar S, Anand R (2022, March) Deep Learning and Machine Learning based Facial Emotion Detection using CNN. In 2022 9th International Conference on Computing for Sustainable Global Development (INDIACom) (pp 530–535). IEEE
Saini P, Anand MR (2014) Identification of Defects in Plastic Gears Using Image Processing and Computer Vision: A Review. Int J Eng Res 3(2):94–99
Acknowledgements
The authors would like to express gratitude to Department of Technical Education and Panipat Institute of Engineering & Technology, Panipat, India. The authors would also like to thank to Vice Chancellor, Dr. A.P.J. Abdul Kalam Technical University, and Uttar Pradesh, India.
Author information
Authors and Affiliations
Contributions
All authors approve the final manuscript.
Corresponding author
Ethics declarations
Ethics approval
Not Applicable (as the results of studies does not involve any human or animal).
Consent to participate
Not Applicable (as the results of studies does not involve any human or animal).
Consent for publication
Not Applicable (as the results of studies does not involve any human or animal).
Conflict of interest / Competing interests
The authors declare that they have ‘no known conflict of interests or personal relationships’ that could have appeared to influence the work reported in this paper.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Raghavan, R., Verma, D.C., Pandey, D. et al. Optimized building extraction from high-resolution satellite imagery using deep learning. Multimed Tools Appl 81, 42309–42323 (2022). https://doi.org/10.1007/s11042-022-13493-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13493-9