skip to main content
10.1145/3615886.3627736acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Self-Attention Driven Decoder for SAR Image-based Semantic Flood Zone Segmentation

Authors Info & Claims
Published:20 November 2023Publication History

ABSTRACT

Floods are destructive natural calamities that endanger people's lives, infrastructure, and the environment. Flood detection that is timely and accurate can help with disaster management and save lives. Flood semantic segmentation from remote sensing data such as SAR images has gained popularity due to recent advances in computer and memory capacity. In this context, encoder-decoder based CNN architectures are widely adopted. However, the inter-class feature sharing in these images makes distinguishing flood-prone zones challenging. To properly extract features and decode class labels, robust encoders and decoders are necessary. A common strategy that dramatically upsamples the decoder's feature maps, in particular, typically causes information loss and gives subpar segmentation results. In this context, the current work proposes a novel decoder that uses a self-attention layer to improve the feature maps before assigning class labels. The proposed method has been statistically and qualitatively verified using publicly available dataset.

References

  1. Vijay Badrinarayanan, Alex Kendall, and Roberto Cipolla. 2017. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence 39, 12 (2017), 2481--2495.Google ScholarGoogle Scholar
  2. Derrick Bonafilia, Beth Tellman, Tyler Anderson, and Erica Issenberg. 2020. Sen1Floods11: A Georeferenced Dataset to Train and Test Deep Learning Flood Algorithms for Sentinel-1. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.Google ScholarGoogle ScholarCross RefCross Ref
  3. Eduardo A Carvalho, Daniela M Ushizima, Fátima NS Medeiros, Charles Iury Oliveira Martins, Regis CP Marques, and Isaura NS Oliveira. 2010. SAR imagery segmentation by statistical region growing and hierarchical merging. Digital Signal Processing 20, 5 (2010), 1365--1378.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Mahaman Sani Chaibou, Karim Kalti, Basel Solaiman, and Mohamed Ali Mahjoub. 2016. Image segmentation by contextual region growing based on fuzzy classification. In 2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP). IEEE, 489--493.Google ScholarGoogle ScholarCross RefCross Ref
  5. Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. 2017. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence 40, 4 (2017), 834--848.Google ScholarGoogle Scholar
  6. Ciro D'Elia, Simona Ruscino, Maurizio Abbate, Bruno Aiazzi, Stefano Baronti, and Luciano Alparone. 2014. SAR image classification through information-theoretic textural features, MRF segmentation, and object-oriented learning vector quantization. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 7, 4 (2014), 1116--1126.Google ScholarGoogle ScholarCross RefCross Ref
  7. Alberto Garcia-Garcia, Sergio Orts-Escolano, Sergiu Oprea, Victor Villena-Martinez, Pablo Martinez-Gonzalez, and Jose Garcia-Rodriguez. 2018. A survey on deep learning techniques for image and video semantic segmentation. Applied Soft Computing 70 (2018), 41--65.Google ScholarGoogle ScholarCross RefCross Ref
  8. Jong-Sen Lee. 1980. Digital image enhancement and noise filtering by use of local statistics. IEEE transactions on pattern analysis and machine intelligence 2 (1980), 165--168.Google ScholarGoogle ScholarCross RefCross Ref
  9. Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3431--3440.Google ScholarGoogle ScholarCross RefCross Ref
  10. Sandro Martinis, André Twele, and Stefan Voigt. 2010. Unsupervised extraction of flood-induced backscatter changes in SAR data using Markov image modeling on irregular graphs. IEEE Transactions on Geoscience and Remote Sensing 49, 1 (2010), 251--263.Google ScholarGoogle ScholarCross RefCross Ref
  11. Ozan Oktay, Jo Schlemper, Loic Le Folgoc, Matthew Lee, Mattias Heinrich, Kazunari Misawa, Kensaku Mori, Steven McDonagh, Nils Y Hammerla, Bernhard Kainz, et al. 2018. Attention u-net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 (2018).Google ScholarGoogle Scholar
  12. A Kai Qin and David A Clausi. 2010. Multivariate image segmentation using semantic region growing with adaptive edge penalty. IEEE transactions on image processing 19, 8 (2010), 2157--2170.Google ScholarGoogle Scholar
  13. Xuebin Qin, Zichen Zhang, Chenyang Huang, Masood Dehghan, Osmar R Zaiane, and Martin Jagersand. 2020. U2-Net: Going deeper with nested U-structure for salient object detection. Pattern recognition 106 (2020), 107404.Google ScholarGoogle Scholar
  14. Masoumeh Rahmani and Gholamreza Akbarizadeh. 2015. Unsupervised feature learning based on sparse coding and spectral clustering for segmentation of synthetic aperture radar images. IET Computer Vision 9, 5 (2015), 629--638.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Anushree Rambhad, Dhirendra Pratap Singh, and Jaytrilok Choudhary. 2023. Detection of Flood Events from Satellite Images Using Deep Learning. In Intelligent Data Engineering and Analytics: Proceedings of the 10th International Conference on Frontiers in Intelligent Computing: Theory and Applications (FICTA 2022). Springer, 259--268.Google ScholarGoogle ScholarCross RefCross Ref
  16. Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention--MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, 234--241.Google ScholarGoogle Scholar
  17. Fan Wang, Yan Wu, Ming Li, Peng Zhang, and Qingjun Zhang. 2016. Adaptive hybrid conditional random field model for SAR image segmentation. IEEE Transactions on Geoscience and Remote Sensing 55, 1 (2016), 537--550.Google ScholarGoogle ScholarCross RefCross Ref
  18. Ritu Yadav, Andrea Nascetti, and Yifang Ban. 2022. Attentive Dual Stream Siamese U-Net for Flood Detection on Multi-Temporal Sentinel-1 Data. In IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium. 5222--5225. https://doi.org/10.1109/IGARSS46834.2022.9883132Google ScholarGoogle ScholarCross RefCross Ref
  19. Wen Yang, Xun Zhang, Lijun Chen, and Hong Sun. 2010. Semantic segmentation of polarimetric SAR imagery using conditional random fields. In 2010 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 1593--1596.Google ScholarGoogle ScholarCross RefCross Ref
  20. Xiwen Yao, Junwei Han, Lei Guo, Shuhui Bu, and Zhenbao Liu. 2015. A coarse-to-fine model for airport detection from remote sensing images using target-oriented visual saliency and CRF. Neurocomputing 164 (2015), 162--172.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Peter Yu, A Kai Qin, and David A Clausi. 2011. Unsupervised polarimetric SAR image segmentation and classification using region growing with edge penalty. IEEE Transactions on Geoscience and Remote Sensing 50, 4 (2011), 1302--1317.Google ScholarGoogle ScholarCross RefCross Ref
  22. Tianwen Zhang, Xiaoling Zhang, Jianwei Li, and Jun Shi. 2022. Contextual Squeeze-and-Excitation Mask R-CNN for SAR Ship Instance Segmentation. In 2022 IEEE Radar Conference (RadarConf22). 1--6. https://doi.org/10.1109/RadarConf2248738.2022.9764228Google ScholarGoogle ScholarCross RefCross Ref
  23. Zongwei Zhou, Md Mahfuzur Rahman Siddiquee, Nima Tajbakhsh, and Jianming Liang. 2018. Unet++: A nested u-net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4. Springer, 3--11.Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    GeoAI '23: Proceedings of the 6th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery
    November 2023
    135 pages
    ISBN:9798400703485
    DOI:10.1145/3615886

    Copyright © 2023 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 20 November 2023

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate17of25submissions,68%
  • Article Metrics

    • Downloads (Last 12 months)32
    • Downloads (Last 6 weeks)4

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader