Skip to main content

Advertisement

Log in

Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework

  • Research
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

Building footprints are the most visible features in urban areas. Detecting building footprint has a substantial position in decision-making problems such as city planning and development, urban mapping and management, population estimation, etc. In this paper, we aim to automatically detect and count building footprints by leveraging deep learning techniques and the potential availability of remote sensing datasets at high spatial resolutions. We build object detection deep learning models by integrating the partitioning segmentation and convolutional neural networks (CNN) architecture using the WorldView-3 very high-resolution optical satellite imagery data. Our detection model is implemented on data from the Paris, France, and Khartoum, Sudan regions. These regions were chosen because it has very different characteristics and large data variation, making it challenging to extract building features from satellite imagery. This challenge is even greater because one area is taken perpendicularly and the other area is taken at an angle of several degrees which causes more noise, one of which is the shadow of the building. The detection models can detect and count the building footprints and have promising performance on large variable data with an Average Precision (AP) of 64.19%. Paris was detected better by 74.66% compared to Khartoum by 56.19%. The partitioning segmentation technique is used to tune the anchor boxes of CNN input images. Our result offers a potential practical implementation for rapid yet accurate estimation of urban monitoring and city planning, particularly in metropolitan regions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Data availability

Data derived from public domain resources, i.e. https://spacenet.ai/datasets/.

References

  • Afira N, Wijayanto AW (2022) Mono-temporal and multi-temporal approaches for burnt area detection using Sentinel-2 satellite imagery (a case study of Rokan Hilir Regency, Indonesia). Ecol Inform 69:101677

    Article  Google Scholar 

  • Agarap AF (2018) Deep learning using rectified linear units (ReLU). arXiv: 1803.08375. https://doi.org/10.48550/arXiv.1803.08375

  • Agarwal S, Terrail JO du, Jurie F (2018) Recent advances in object detection in the age of deep convolutional neural networks. arXiv: 1809.03193. https://doi.org/10.48550/arXiv.1809.03193

  • Alganci U, Soydas M, Sertel E (2020) Comparative Research on Deep Learning Approaches for Airplane Detection from Very High-Resolution Satellite Images. Remote Sens (Basel) 12, 458

  • Alidoost F, Arefi H (2018) A CNN-Based Approach for Automatic Building Detection and Recognition of Roof Types Using a Single Aerial Image. PFG – J Photogramm Remote Sens Geoinform Sci 86, 235–248

  • Awrangjeb M, Fraser C (2014) Automatic Segmentation of Raw LIDAR Data for Extraction of Building Roofs. Remote Sens (basel) 6:3716–3751

    Article  Google Scholar 

  • Belgiu M, Drǎguţ L (2014) Comparing supervised and unsupervised multiresolution segmentation approaches for extracting buildings from very high resolution imagery. ISPRS J Photogramm Remote Sens 96:67–75

    Article  Google Scholar 

  • Benali A, Dermeche H, Belhadj S, Adnane A, Hanifi Elhachemi Amar R (2014) Buildings extraction of very high spatial resolution satellite images. in 2014 International Conference on Multimedia Computing and Systems (ICMCS) 277–282 (IEEE). https://doi.org/10.1109/ICMCS.2014.6911229.

  • Bochkovskiy A (2016) darknet. https://github.com/AlexeyAB

  • Bonczak B, Kontokosta CE (2019) Large-scale parameterization of 3D building morphology in complex urban landscapes using aerial LiDAR and city administrative data. Comput Environ Urban Syst 73:126–142

    Article  Google Scholar 

  • Chartock E, LaRow W, Singh V (2017) Extraction of building footprints from satellite imagery. Stanford University Report. http://vision.stanford.edu/teaching/cs231n/reports/2017/pdfs/550.pdf. Accessed 1 Mar 2022

  • Cheng L, Gong J, Li M, Liu Y (2011) 3D Building Model Reconstruction from Multi-view Aerial Imagery and Lidar Data. Photogramm Eng Remote Sens 77:125–139

    Article  Google Scholar 

  • Das SK, PS P, Aithal B (2018) Automated Building Extraction using High Resolution Satellite Imagery though Ensemble Modelling and Machine Learning. Remote Sensing of Land 2, 31–46

  • Dorninger P, Pfeifer N (2008) A Comprehensive Automated 3D Approach for Building Extraction, Reconstruction, and Regularization from Airborne Laser Scanning Point Clouds. Sensors 8:7323–7343

    Article  Google Scholar 

  • Gao J, Chen Y, Wei Y, Li J (2021) Detection of Specific Building in Remote Sensing Images Using a Novel YOLO-S-CIOU Model. Case: Gas Station Identification. Sensors 21, 1375

  • Girshick R, Donahue J, Darrell T, Malik J (2014) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. in 2014 IEEE Conference on Computer Vision and Pattern Recognition 580–587 (IEEE). https://doi.org/10.1109/CVPR.2014.81

  • Guo M, Liu H, Xu Y, Huang Y (2020) Building Extraction Based on U-Net with an Attention Block and Multiple Losses. Remote Sens (Basel) 12:1400

    Article  Google Scholar 

  • Haala N, Brenner C (1999) Extraction of buildings and trees in urban environments. ISPRS J Photogramm Remote Sens 54:130–137

    Article  Google Scholar 

  • Han Q, Yin Q, Zheng X, Chen Z (2021) Remote sensing image building detection method based on Mask R-CNN. Complex IntellSyst https://doi.org/10.1007/s40747-021-00322-z

  • Huang X, Zhang L (2012) Morphological Building/Shadow Index for Building Extraction From High-Resolution Imagery Over Urban Areas. IEEE J Sel Top Appl Earth Obs Remote Sens 5:161–172

    Article  Google Scholar 

  • Ide H, Kurita T (2017) Improvement of learning for CNN with ReLU activation by sparse regularization. in 2017 International Joint Conference on Neural Networks (IJCNN) 2684–2691 (IEEE). https://doi.org/10.1109/IJCNN.2017.7966185

  • Ioffe S, Szegedy, C (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. CoRR abs/1502.03167

  • Johnson M. et al. (2016) Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation. CoRR abs/1611.04558

  • Joshi KK, Kono T (2009) Optimization of floor area ratio regulation in a growing city. Reg Sci Urban Econ 39:502–511

    Article  Google Scholar 

  • Khoshboresh-Masouleh M, Shah-Hosseini R (2021) A Deep Multi-Modal Learning Method and a New RGB-Depth Data Set for Building Roof Extraction. Photogramm Eng Remote Sensing 87:759–766

    Article  Google Scholar 

  • Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet Classification with Deep Convolutional Neural Networks. in Advances in Neural Information Processing Systems (eds. Pereira, F., Burges, C. J. C., Bottou, L. & Weinberger, K. Q.) vol. 25 (Curran Associates, Inc.)

  • Liu L et al (2020) Deep Learning for Generic Object Detection: A Survey. Int J Comput vis 128:261–318

    Article  Google Scholar 

  • Liu L et al (2021) Learning region-guided scale-aware feature selection for object detection. Neural Comput Appl 33:6389–6403

    Article  Google Scholar 

  • Liu W et al. (2016) SSD: Single Shot MultiBox Detector. in 21–37. https://doi.org/10.1007/978-3-319-46448-0_2

  • Maas HG, Vosselman G (1999) Two algorithms for extracting building models from raw laser altimetry data. ISPRS J Photogramm Remote Sens 54:153–163

    Article  Google Scholar 

  • Monna F. et al. (2021) Deep learning to detect built cultural heritage from satellite imagery. Spatial distribution and size of vernacular houses in Sumba, Indonesia.  J Cult Herit 52, 171–183

  • Nurmasari Y, Wijayanto AW (2021) Oil Palm Plantation Detection in Indonesia Using Sentinel-2 and Landsat-8 Optical Satellite Imagery (Case Study: Rokan Hulu Regency, Riau Province). Intl J Remote Sens Earth Sci (IJReSES) 18:1–18. https://doi.org/10.30536/j.ijreses.2021.v18.a3537

  • Park Y, Guldmann JM (2019) Creating 3D city models with building footprints and LIDAR point cloud classification: A machine learning approach. Comput Environ Urban Syst 75:76–89

    Article  Google Scholar 

  • Park Y, Guldmann JM, Liu D (2021) Impacts of tree and building shades on the urban heat island: Combining remote sensing, 3D digital city and spatial regression approaches. Comput Environ Urban Syst 88:101655

    Article  Google Scholar 

  • Pham M-T, Courtrai L, Friguet C, Lefèvre S, Baussard A (2020) YOLO-Fine: One-Stage Detector of Small Objects Under Various Backgrounds in Remote Sensing Images. Remote Sens (Basel) 12, 2501

  • Putri SR, Wijayanto AW, Sakti AD (2022) Developing Relative Spatial Poverty Index Using Integrated Remote Sensing and Geospatial Big Data Approach: A Case Study of East Java Indonesia. ISPRS Int J Geoinf 11:275

    Article  Google Scholar 

  • Putri SR, Wijayanto AW (2022) Learning Bayesian Network for Rainfall Prediction Modeling in Urban Area using Remote Sensing Satellite Data (Case Study: Jakarta, Indonesia). in International Conference on Data Science and Official Statistics 77–90

  • Redmon J, Farhadi A (2018) YOLOv3: an incremental improvement. arXiv:1804.02767. https://doi.org/10.48550/arXiv.1804.02767

  • Redmon J, Divvala S, Girshick R, Farhadi A (2015) You only look once: unified, real-time object detection. arXiv:1506.02640. https://doi.org/10.48550/arXiv.1506.02640

  • Ren Y, Zhu C, Xiao S (2018) Small Object Detection in Optical Remote Sensing Images via Modified Faster R-CNN. Appl Sci 8:813

    Article  Google Scholar 

  • Rongshuang F, Yang C, Qiheng X, Jingxue W (2019) A high-resolution remote sensing image building extraction method based on deep learning. Acta Geodaetica Et Cartographica Sinica 48:34

    Google Scholar 

  • Rylatt M, Gadsden S, Lomas K (2001) GIS-based decision support for solar energy planning in urban environments. Comput Environ Urban Syst 25:579–603

    Article  Google Scholar 

  • Saadi TDT, Wijayanto AW (2021) Machine learning applied to sentinel-2 and landsat-8 multispectral and medium-resolution satellite imagery for the detection of rice production areas in Nganjuk, East Java, Indonesia. Intl J Remote Sens Earth Sci (IJReSES) 18:19. https://doi.org/10.30536/j.ijreses.2021.v18.a3538

  • Singh G, Jouppi M, Zhang Z, Zakhor A (2015) Shadow based building extraction from single satellite image. in (eds. Bouman, C. A. & Sauer, K. D.) 94010F. doi:https://doi.org/10.1117/12.2083500

  • SpaceNet on Amazon Web Services (AWS) (2018) ‘Datasets.’ The SpaceNet Catalog. https://spacenet.ai/datasets/. Accessed 12 Feb 2022

  • Tian D, Han Y, Wang B, Guan T, Gu H, Wei W (2021) Review of object instance segmentation based on deep learning. J Electron Imaging 31. https://doi.org/10.1117/1.JEI.31.4.041205

  • Triscowati DW, Sartono B, Kurnia A, Domiri DD, Wijayanto AW (2019) Multitemporal remote sensing data for classification of food crops plant phase using supervised random forest. in Sixth Geoinformation Science Symposium (eds. Wibowo, S. B. et al.) 10 (SPIE). https://doi.org/10.1117/12.2547216

  • Triscowati DW, Sartono B, Kurnia A, Dirgahayu D, Wijayanto AW (2020) Classification of rice-plant growth phase using supervised random forest method based on landsat-8 multitemporal data. Intl J Remote Sens Earth Sci (IJReSES) 16:187. https://doi.org/10.30536/j.ijreses.2019.v16.a3217

  • van Etten A, Lindenbaum D, Bacastow TM (2018) SpaceNet: a remote sensing dataset and challenge series. arXiv:1807.01232. https://doi.org/10.48550/arXiv.1807.01232

  • Wen Q et al (2019) Automatic Building Extraction from Google Earth Images under Complex Backgrounds Based on Deep Instance Segmentation Network. Sensors 19:333

    Article  Google Scholar 

  • Wijayanto AW, Wahyu Triscowati D, Marsuhandi AH (2020) Maize field area detection in East Java, Indonesia: An integrated multispectral remote sensing and machine learning approach. in 2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE) 168–173 (IEEE). https://doi.org/10.1109/ICITEE49829.2020.9271683

  • Wijayanto AW, Afira N, Nurkarim W (2022) Machine Learning Approaches using Satellite Data for Oil Palm Area Detection in Pekanbaru City, Riau. in 2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom) 84–89 (IEEE). https://doi.org/10.1109/CyberneticsCom55287.2022.9865301

  • Xie Y, Cai J, Bhojwani R, Shekhar S, Knight J (2020) A locally-constrained YOLO framework for detecting small and densely-distributed building footprints. Int J Geogr Inf Sci 34:777–801

    Article  Google Scholar 

  • Xiong W. et al. (2018) The Microsoft 2017 Conversational Speech Recognition System. in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 5934–5938 (IEEE). https://doi.org/10.1109/ICASSP.2018.8461870.

Download references

Acknowledgements

The authors are grateful to acknowledge the support from Statistics Indonesia (BPS) and Politeknik Statistika STIS, and would like to thank for Van Etten and DigitalGlobe for providing the satellite imagery very high-resolution WorldView-3 for the building footprint mapping challenge (SpaceNet at Amazon Web Services (AWS). “Datasets.” The SpaceNet Catalog. Last modified October 1st, 2018. Accessed on [December 18th, 2021]. https://spacenet.ai/datasets/)

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception, design and analysis. Data collection, experimental simulations, and writing of original draft preparation were performed by Wahidya Nurkarim. Selecting the methodology, supervising the project, review and editing the manuscript were performed by Arie Wahyu Wijayanto. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Arie Wahyu Wijayanto.

Ethics declarations

Competing interests

The authors declare that they have no known competing financial or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Communicated by: H. Babaie

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nurkarim, W., Wijayanto, A.W. Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework. Earth Sci Inform 16, 515–532 (2023). https://doi.org/10.1007/s12145-022-00895-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12145-022-00895-4

Keywords

Navigation