Abstract
U-Net has been widely applied in semantic segmentation tasks, but it faces challenges in the semantic segmentation of high-resolution remote sensing images due to the loss of boundary information during the downsampling process and the inherent blurriness of object boundaries in remote sensing images. We propose an advanced U-Net variant model that addresses these issues. By introducing the CBAM attention mechanism, we enhance the extraction of boundary information during the downsampling process, and by incorporating a cascaded edge detection module, we significantly improve the model's boundary segmentation performance. As a result, the model demonstrates excellent performance in the segmentation of high-resolution remote sensing images. The results indicate that our proposed model outperforms other baseline models and exhibits superior performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Boulila, W., Farah, I.R., Ettabaa, K.S., et al.: Spatio-temporal modeling for knowledge discovery in satellite image databases, pp. 35–49 (2010)
Boulila, W.: A top-down approach for semantic segmentation of big remote sensing images 12(3), 295–306 (2019)
Zhou, Z., Zhang, J., Gong, C.: Hybrid semantic segmentation for tunnel lining cracks based on Swin Transformer and convolutional neural network 38(17), 2491–2510 (2023)
Badrinarayanan, V., Kendall, A., Cipolla, R.: Segnet: A deep convolutional encoder-decoder architecture for image segmentation 39(12), 2481–2495 (2017)
Zhang, S., Zhang, C.: Modified U-Net for plant diseased leaf image segmentation 204, 107511 (2023)
Abdollahi, A., Pradhan, B., Alamri, A.M.: An ensemble architecture of deep convolutional Segnet and Unet networks for building semantic segmentation from high-resolution aerial images 37(12), 3355–3370 (2022)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440. Institute of Electrical and Electronics Engineers (IEEE) (2015)
Zhang, D., Zhang, L., Tang, J.: Augmented FCN: rethinking context modeling for semantic segmentation 66(4), 142105 (2023)
Fifty, C., Amid, E., Zhao, Z., et al.: Efficiently identifying task groupings for multi task learning 34, 27503–27516 (2021)
Samant, R.M., Bachute, M.R., Gite, S., et al.: Framework for deep learning-based language models using multi task learning in natural language understanding: a systematic literature review and future directions 10, 17078–17097 (2022)
Chen, J., Zhu, D., Shen, X., et al.: Minigpt-v2: large language model as a unified interface for vision-language multi task learning 2310, 09478 (2023)
Cheng, G., Wang, Y., Xu, S., et al.: Automatic road detection and centerline extraction via cascaded end-to-end convolutional neural network 55(6), 3322–3337 (2017)
Murugesan, B., Sarveswaran, K., Shankaranarayana, S.M., et al.: Conv-MCD: a plug-and-play multi task module for medical image segmentation. In: Machine Learning in Medical Imaging: 10th International Workshop, pp. 292–300. Springer International (2019)
Murugesan, B., Sarveswaran, K., Shankaranarayana, S.M., et al.: Psi-Net: shape and boundary aware joint multi task deep network for medical image segmentation. In: 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 7223–7226. Institute of Electrical and Electronics Engineers (IEEE) (2019)
Dong, X., Bao, J., Chen, D., et al.: Cswin transformer: A general vision transformer backbone with cross-shaped windows. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12124–12134. Institute of Electrical and Electronics Engineers (IEEE) (2022)
Wei, Y., Zhang, K., Ji, S.: Simultaneous road surface and centerline extraction from large-scale remote sensing images using CNN-based segmentation and tracing 58(12), 8919–8931 (2020)
Ding, L., Bruzzone, L.: DiResNet: Direction-aware residual network for road extraction in VHR remote sensing images 59(12), 10243–10254 (2020)
Waldner, F., Diakogiannis, F.I.: Deep learning on edge: extracting field boundaries from satellite images with a convolutional neural network 245, 111741 (2020)
Diakogiannis, F.I., Waldner, F., Caccetta, P., et al.: ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed data 162, 94–114 (2020)
Qin, J., Lang, D., Gao, C.: Feature extraction of time series data based on CNN-CBAM. In: Yu, Z., et al. (eds.) Data Science. ICPCSEE 2023. CCIS, vol. 1879. Springer, Singapore (2023). https://doi.org/10.1007/978-981-99-5968-6_17
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Cao, X., Qin, J. (2024). CCU-NET: CBAM and Cascaded Edge Detection Optimization U-NET for Remote Sensing Image Segmentation. In: Xu, C., et al. Data Science. ICPCSEE 2024. Communications in Computer and Information Science, vol 2215. Springer, Singapore. https://doi.org/10.1007/978-981-97-8749-4_12
Download citation
DOI: https://doi.org/10.1007/978-981-97-8749-4_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-8748-7
Online ISBN: 978-981-97-8749-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)