Skip to main content

CCU-NET: CBAM and Cascaded Edge Detection Optimization U-NET for Remote Sensing Image Segmentation

  • Conference paper
  • First Online:
Data Science (ICPCSEE 2024)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2215))

  • 168 Accesses

Abstract

U-Net has been widely applied in semantic segmentation tasks, but it faces challenges in the semantic segmentation of high-resolution remote sensing images due to the loss of boundary information during the downsampling process and the inherent blurriness of object boundaries in remote sensing images. We propose an advanced U-Net variant model that addresses these issues. By introducing the CBAM attention mechanism, we enhance the extraction of boundary information during the downsampling process, and by incorporating a cascaded edge detection module, we significantly improve the model's boundary segmentation performance. As a result, the model demonstrates excellent performance in the segmentation of high-resolution remote sensing images. The results indicate that our proposed model outperforms other baseline models and exhibits superior performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Boulila, W., Farah, I.R., Ettabaa, K.S., et al.: Spatio-temporal modeling for knowledge discovery in satellite image databases, pp. 35–49 (2010)

    Google Scholar 

  2. Boulila, W.: A top-down approach for semantic segmentation of big remote sensing images 12(3), 295–306 (2019)

    Google Scholar 

  3. Zhou, Z., Zhang, J., Gong, C.: Hybrid semantic segmentation for tunnel lining cracks based on Swin Transformer and convolutional neural network 38(17), 2491–2510 (2023)

    Google Scholar 

  4. Badrinarayanan, V., Kendall, A., Cipolla, R.: Segnet: A deep convolutional encoder-decoder architecture for image segmentation 39(12), 2481–2495 (2017)

    Google Scholar 

  5. Zhang, S., Zhang, C.: Modified U-Net for plant diseased leaf image segmentation 204, 107511 (2023)

    Google Scholar 

  6. Abdollahi, A., Pradhan, B., Alamri, A.M.: An ensemble architecture of deep convolutional Segnet and Unet networks for building semantic segmentation from high-resolution aerial images 37(12), 3355–3370 (2022)

    Google Scholar 

  7. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440. Institute of Electrical and Electronics Engineers (IEEE) (2015)

    Google Scholar 

  8. Zhang, D., Zhang, L., Tang, J.: Augmented FCN: rethinking context modeling for semantic segmentation 66(4), 142105 (2023)

    Google Scholar 

  9. Fifty, C., Amid, E., Zhao, Z., et al.: Efficiently identifying task groupings for multi task learning 34, 27503–27516 (2021)

    Google Scholar 

  10. Samant, R.M., Bachute, M.R., Gite, S., et al.: Framework for deep learning-based language models using multi task learning in natural language understanding: a systematic literature review and future directions 10, 17078–17097 (2022)

    Google Scholar 

  11. Chen, J., Zhu, D., Shen, X., et al.: Minigpt-v2: large language model as a unified interface for vision-language multi task learning 2310, 09478 (2023)

    Google Scholar 

  12. Cheng, G., Wang, Y., Xu, S., et al.: Automatic road detection and centerline extraction via cascaded end-to-end convolutional neural network 55(6), 3322–3337 (2017)

    Google Scholar 

  13. Murugesan, B., Sarveswaran, K., Shankaranarayana, S.M., et al.: Conv-MCD: a plug-and-play multi task module for medical image segmentation. In: Machine Learning in Medical Imaging: 10th International Workshop, pp. 292–300. Springer International (2019)

    Google Scholar 

  14. Murugesan, B., Sarveswaran, K., Shankaranarayana, S.M., et al.: Psi-Net: shape and boundary aware joint multi task deep network for medical image segmentation. In: 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 7223–7226. Institute of Electrical and Electronics Engineers (IEEE) (2019)

    Google Scholar 

  15. Dong, X., Bao, J., Chen, D., et al.: Cswin transformer: A general vision transformer backbone with cross-shaped windows. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12124–12134. Institute of Electrical and Electronics Engineers (IEEE) (2022)

    Google Scholar 

  16. Wei, Y., Zhang, K., Ji, S.: Simultaneous road surface and centerline extraction from large-scale remote sensing images using CNN-based segmentation and tracing 58(12), 8919–8931 (2020)

    Google Scholar 

  17. Ding, L., Bruzzone, L.: DiResNet: Direction-aware residual network for road extraction in VHR remote sensing images 59(12), 10243–10254 (2020)

    Google Scholar 

  18. Waldner, F., Diakogiannis, F.I.: Deep learning on edge: extracting field boundaries from satellite images with a convolutional neural network 245, 111741 (2020)

    Google Scholar 

  19. Diakogiannis, F.I., Waldner, F., Caccetta, P., et al.: ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed data 162, 94–114 (2020)

    Google Scholar 

  20. Qin, J., Lang, D., Gao, C.: Feature extraction of time series data based on CNN-CBAM. In: Yu, Z., et al. (eds.) Data Science. ICPCSEE 2023. CCIS, vol. 1879. Springer, Singapore (2023). https://doi.org/10.1007/978-981-99-5968-6_17

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiaji Qin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Cao, X., Qin, J. (2024). CCU-NET: CBAM and Cascaded Edge Detection Optimization U-NET for Remote Sensing Image Segmentation. In: Xu, C., et al. Data Science. ICPCSEE 2024. Communications in Computer and Information Science, vol 2215. Springer, Singapore. https://doi.org/10.1007/978-981-97-8749-4_12

Download citation

Publish with us

Policies and ethics