Deep Learning Models for Architectural Façade Detection in Spherical Images

Kutrzyński, Marcin; Żak, Bartosz; Telec, Zbigniew; Trawiński, Bogdan

doi:10.1007/978-3-030-88081-1_40

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12876))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1516 Accesses

Abstract

Maintaining a public register by extracting urban objects from photos taken in the city is one of the most important tasks for municipal services. It is of great importance in the field of protection and shaping of the cultural landscape, protection of monuments, and registration of the urban tissue development. The current state of the art shows that deep learning models (DL models) can cope with the problem of extracting urban objects with the same or better performance than non-DL models, and can process video and photos automatically. This paper compares the three main DL models for facade instance detection and facade segmentation: Mask R-CNN, YOLACT, and Mask-Scoring R-CNN. The training and validation datasets used for transfer learning were created on the basis of spherical photos taken in an artificially generated virtual city. The test dataset, on the other hand, included spherical façade photos taken in a real city. The comparative analysis of the DL models was performed using parametric and nonparametric statistical tests for pairwise and multiple comparisons.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

An Approach to Automatic Detection of Architectural Façades in Spherical Images

Deep Learning Methods for Extracting Object-Oriented Models of Building Interiors from Images

Convolutional Neural Network for Detection of Building Contours Using Multisource Spatial Data

References

Caffe2 model zoo. https://caffe2.ai/docs/zoo.html. Accessed 6 Aug 2021
Pyprt - python bindings for cityengine sdk. https://github.com/Esri/pyprt. Accessed 6 Aug 2021
scikit-posthocs. https://scikit-posthocs.readthedocs.io/. Accessed 6 Aug 2021
Vgg image annotator (via). https://www.robots.ox.ac.uk/~vgg/software/via/via.html. Accessed 6 Aug 2021
py360convert. https://github.com/sunset1995/py360convert (2020). [Online; Accessed 15 Jan 2021
Bolya, D., Zhou, C., Xiao, F., Lee, Y.J.: YOLACT: real-time instance segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9157–9166 (2019)
Google Scholar
Chen, X., Girshick, R., He, K., Dollár, P.: TensorMask: a foundation for dense object segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2061–2069 (2019)
Google Scholar
Girshick, R.: Fast r-cnn. CoRR abs/1504.08083 (2015). http://www.cv-foundation.org/openaccess/content_iccv_2015/papers/Girshick_Fast_R-CNN_ICCV_2015_paper.pdf
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2980–2988 (2017). https://doi.org/10.1109/ICCV.2017.322
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90
Hernández, J., Marcotegui, B.: Morphological segmentation of building façade images. In: 2009 16th IEEE International Conference on Image Processing (ICIP), pp. 4029–4032 (2009). https://doi.org/10.1109/ICIP.2009.5413756
Huang, Z., Huang, L., Gong, Y., Huang, C., Wang, X.: Mask scoring R-CNN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6409–6418 (2019)
Google Scholar
Kutrzyński, M., Żak, B., Telec, Z., Trawiński, B.: An approach to automatic detection of architectural façades in spherical images. In: Intelligent Information and Database Systems: 13th Asian Conference, ACIIDS 2021, Phuket, Thailand, 7–10 April 2021, Proceedings 13, pp. 494–504. Springer (2021). https://doi.org/10.1007/978-3-030-73280-6_39
Lin, T.Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12(Oct), 2825–2830 (2011)
Google Scholar
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2016)
Article Google Scholar
Sümer, E., Türker, M.: An automatic region growing based approach to extract facade textures from single ground-level building images. J. Geodesy Geoinf. 2(1), 9–17 (2013)
Article Google Scholar
Wendel, A., Donoser, M., Bischof, H.: Unsupervised facade segmentation using repetitive patterns, vol. 6376, pp. 51–60 (2010). https://doi.org/10.1007/978-3-642-15986-2_6

Download references

Author information

Authors and Affiliations

Department of Applied Informatics, Wrocław University of Science and Technology, Wrocław, Poland
Marcin Kutrzyński, Bartosz Żak, Zbigniew Telec & Bogdan Trawiński

Authors

Marcin Kutrzyński
View author publications
You can also search for this author in PubMed Google Scholar
Bartosz Żak
View author publications
You can also search for this author in PubMed Google Scholar
Zbigniew Telec
View author publications
You can also search for this author in PubMed Google Scholar
Bogdan Trawiński
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bogdan Trawiński .

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen
Democritus University of Thrace, Kimmeria, Xanthi, Greece
Lazaros Iliadis
University of Piraeus, Piraeus, Greece
Ilias Maglogiannis
Wrocław University of Science and Technology, Wrocław, Poland
Bogdan Trawiński

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kutrzyński, M., Żak, B., Telec, Z., Trawiński, B. (2021). Deep Learning Models for Architectural Façade Detection in Spherical Images. In: Nguyen, N.T., Iliadis, L., Maglogiannis, I., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2021. Lecture Notes in Computer Science(), vol 12876. Springer, Cham. https://doi.org/10.1007/978-3-030-88081-1_40

Download citation

DOI: https://doi.org/10.1007/978-3-030-88081-1_40
Published: 30 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88080-4
Online ISBN: 978-3-030-88081-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics