AE-UNet: a composite lung CT image segmentation framework using attention mechanism and edge detection

Li, Hongzhi; Ren, Zhanghao; Zhu, Guoqing; Wang, Jiaxi

doi:10.1007/s11227-024-06874-4

AE-UNet: a composite lung CT image segmentation framework using attention mechanism and edge detection

Published: 24 December 2024

Volume 81, article number 331, (2025)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Hongzhi Li¹,
Zhanghao Ren¹,
Guoqing Zhu¹ &
…
Jiaxi Wang^1,2,3

251 Accesses
1 Citation
Explore all metrics

Abstract

The primary impediments in lung CT image segmentation stem from the ambiguity in edge definition and the inadequate segmentation accuracy. Addressing these issues, this paper introduces a novel composite lung CT image segmentation framework that integrates an attention mechanism with an edge detection operator. We utilize residual dynamic convolutions as the encoder to augment the network's capability for extracting and representing nuanced lesion features. Sobel edge detection is integrated into the skip connections to facilitate the transmission and utilization of edge information. In particular, we introduce an information fusion attention module for deeper layers, optimizing feature reorganization and utilization by attention mechanisms and dilated convolution. Experimental evaluations on two lung CT datasets reveal that our proposed AE-UNet achieves outstanding segmentation performance, surpassing the best baseline network by an average of 0.93%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

ACX-UNet: a multi-scale lung parenchyma segmentation study with improved fusion of skip connection and circular cross-features extraction

Article 28 September 2023

GLUNet: Global-Local Fusion U-Net for 2D Medical Image Segmentation

Combining CNN and Self-attention-Free Transformer Using Local-Global Attention Fusion for Lung Cancer Segmentation

Data availability

No datasets were generated or analyzed during the current study.

References

Feng D, Haase-Schütz C, Rosenbaum L et al (2020) Deep multi-modal object detection and semantic segmentation for autonomous driving: datasets, methods, and challenges. IEEE Trans Intell Transp Syst 22(3):1341–1360
Article Google Scholar
Yuan X, Shi J, Gu L (2021) A review of deep learning methods for semantic segmentation of remote sensing imagery. Expert Syst Appl 169:114417
Article Google Scholar
Otsu N (1975) A threshold selection method from gray-level histograms. Automatica 11(285–296):23–27
Google Scholar
Kanopoulos N, Vasanthavada N, Baker RL (1988) Design of an image edge detection filter using the Sobel operator. IEEE J Solid-State Circuits 23(2):358–367
Article Google Scholar
Deng CX, Wang GB, Yang XR (2013) Image edge detection algorithm based on improved canny operator. In: 2013 International Conference on Wavelet Analysis and Pattern Recognition. IEEE, pp 168–172
Nock R, Nielsen F (2004) Statistical region merging. IEEE Trans Pattern Anal Mach Intell 26(11):1452–1458
Article Google Scholar
Boykov Y, Kolmogorov V (2004) An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans Pattern Anal Mach Intell 26(9):1124–1137
Article Google Scholar
Elnakib A, Gimel’farb G, Suri JS, et al. (2011) Medical image segmentation: a brief survey. Multi Modality State-of-the-Art Medical Image Segmentation and Registration Methodologies: Volume II, pp 1–39
Ramesh KKD, Kumar GK, Swapna K et al (2021) A review of medical image segmentation algorithms. EAI Endorsed Trans Pervasive Health Technol 7(27):e6–e6
Article Google Scholar
Patil DD, Deore SG (2013) Medical image segmentation: a review. Int J Comput Sci Mob Comput 2(1):22–27
Google Scholar
Minaee S, Boykov Y, Porikli F et al (2021) Image segmentation using deep learning: a survey. IEEE Trans Pattern Anal Mach Intell 44(7):3523–3542
Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3431–3440
Chen LC, Papandreou G, Kokkinos I, et al. (2014) Semantic image segmentation with deep convolutional nets and fully connected crfs. arxiv preprint https://arxiv.org/abs/1412.7062
Chen LC, Papandreou G, Kokkinos I et al (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848
Article Google Scholar
Chen LC, Papandreou G, Schroff F, et al. (2017) Rethinking atrous convolution for semantic image segmentation. arxiv preprint https://arxiv.org/abs/1706.05587
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, part III 18. Springer, pp 234–241
Zhou Z, Siddiquee MMR, Tajbakhsh N et al (2019) Unet++: redesigning skip connections to exploit multiscale features in image segmentation. IEEE Trans Med Imaging 39(6):1856–1867
Article Google Scholar
Huang H, Lin L, Tong R, et al. (2020) Unet 3+: A full-scale connected unet for medical image segmentation. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1055–1059
Çiçek Ö, Abdulkadir A, Lienkamp SS, et al. (2016) 3D U-Net: learning dense volumetric segmentation from sparse annotation. In: Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016: 19th International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II 19. Springer, pp 424–432
Nyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. preprint https://arxiv.org/abs/1409.1556
He K, Zhang X, Ren S, et al. (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 770–778
Oktay O, Schlemper J, Folgoc LL et al. (2018) Attention u-net: Learning where to look for the pancreas. arxiv preprint https://arxiv.org/abs/1804.03999
Jiang Z, Dong J et al (2020) Attention gate resU-Net for automatic MRI brain tumor entation. IEEE Access 8:58533–58545
Article Google Scholar
Dhaubhadel PM, Lee JK, Tian Q (2024) Attention-aware DAE for automated solar coronal loop segmentation
Li S, Dong M, Du G et al (2019) Attention dense-u-net for automatic breast mass segmentation in digital mammogram. Ieee Access 7:59037–59047
Article Google Scholar
Iandola F, Moskewicz M, Karayev S et al. (2014) Densenet: Implementing efficient convnet descriptor pyramids. arxiv preprint https://arxiv.org/abs/1404.1869
Liu J, Kuang H et al (2022) A fully automated multimodal MRI-based multi-task learning for a segmentation and IDH genotyping. IEEE Trans Med Imaging 41(6):1532
Google Scholar
Wang H, Chen Z et al (2022) TransUNet+: redesigning the skip connection to enhance features in cal image segmentation. Knowl-Based Syst 256:109859
Article Google Scholar
Cao P, Wang J, et al. Uctransnet: rethinking the skip connections in u-net from a channel-perspective with transformer. In: Proceedings of the AAAI Conference on Artificial Intelligence 36(3): 2441–2449
Dharampal VM (2015) Methods of image edge detection: a review. J Electr Electron Syst 4(2):2332–2796
Google Scholar
Zunair H, Hamza AB (2021) Sharp U-Net: depthwise convolutional network for biomedical image segmentation. Comput Biol Med 136:104699
Article Google Scholar
Song H, Wang Y, Zeng S et al (2023) OAU-net: outlined Attention U-net for biomedical image segmentation. Biomed Signal Process Control 79:104038
Article Google Scholar
Lin Y, Zhang D, Fang X, et al. (2023) Rethinking boundary detection in deep learning models for medical image segmentation. In: International Conference on Information Processing in Medical Imaging. Springer, Cham, pp 730–742
Xu M, Ma Q, Zhang H et al (2024) MEF-UNet: an end-to-end ultrasound image segmentation algorithm based on multi-scale feature extraction and fusion. Comput Med Imaging Graph 114:102370
Article Google Scholar
Milletari F, Navab N, Ahmadi SA (2016) V-net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV). IEEE, pp 565–571
Xiao X, Lian S, Luo Z, et al. (2018) Weighted res-unet for high-quality retina vessel segmentation. In: 2018 9th International Conference on Information Technology in Medicine and Education (ITME). IEEE, pp 327–331
Ibtehaz N, Rahman MS (2020) MultiResUNet: Rethinking the U-Net architecture for multimodal biomedical image segmentation. Neural Netw 121:74–87
Article Google Scholar
Vaswani A, Shazeer N, Parmar N, et al. (2017) Attention is all you need. Adv Neural Inf Process Syst 30
Chen J, Lu Y, Yu Q, et al. (2021) Transunet: transformers make strong encoders for medical image segmentation. arxiv preprint https://arxiv.org/abs/2102.04306
Liu Z, Lin Y, Cao Y, et al. (2021) Swin transformer: Hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 10012–10022
Cao H, Wang Y, Chen J, et al. (2022) Swin-unet: Unet-like pure transformer for medical image segmentation. In: European Conference on Computer Vision. Springer, Cham, pp 205–218
Wenxuan W, Chen C, Meng D et al. (2021) Transbts: multimodal brain tumor segmentation using transformer. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, pp 109–119
Qiu Z, Xu T, Langerman J et al (2021) A deep learning approach for segmentation, classification, and visualization of 3-D high-frequency ultrasound images of mouse embryos. IEEE Trans Ultrason Ferroelectr Freq Control 68(7):2460–2471
Article Google Scholar
Kuo J, Qiu Z, Aristizabal O, et al. (2018) Automatic body localization and brain ventricle segmentation in 3D high frequency ultrasound images of mouse embryos. In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018). IEEE, pp 635–639
Qiu Z, Langerman J, Nair N, et al. (2018) Deep bv: A fully automated system for brain ventricle localization and segmentation in 3d ultrasound images of embryonic mice. In: 2018 IEEE Signal Processing in Medicine and Biology Symposium (SPMB). IEEE, pp 1–6
Xu T, Qiu Z, Das W, et al. (2020) Deep mouse: an end-to-end auto-context refinement framework for brain ventricle & body segmentation in embryonic mice ultrasound volumes. In: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI). IEEE, pp 122–126
Qiu Z, Nair N, Langerman J, et al. (2019) Automatic mouse embryo brain ventricle & body segmentation and mutant classification from ultrasound data using deep learning. In: 2019 IEEE International Ultrasonics Symposium (IUS). IEEE, pp 12–15
Chen Y, Dai X, Liu M, et al. (2020) Dynamic convolution: Attention over convolution kernels. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 11030–11039
Wang Q, Wu B, Zhu P, et al. (2020) ECA-Net: Efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 11534–11542
Zhu X, Cheng D, Zhang Z, et al. (2019) An empirical study of spatial attention mechanisms in deep networks. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 6688–6697
Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions. arxiv preprint https://arxiv.org/abs/1511.07122
Mader KS (2017) Finding and measuring lungs in CT data, 2017, https://www.kaggle.com/kmader/finding-lungs-in-ct-data/data/
Jun M et al. (2020) COVID-19 CT lung and infection segmentation dataset. Zenodo 20
Chen Y, Wang K, Liao X et al (2019) Channel-Unet: a spatial channel-wise convolutional neural network for liver and tumors segmentation. Front Genet 10:1110
Article Google Scholar
Yin Y, Han Z, Jian M et al (2023) AMSUnet: a neural network using atrous multi-scale convolution for medical image segmentation. Comput Biol Med 162:107120
Article Google Scholar
Tang F, Ding J, Quan Q, et al. (2024) Cmunext: an efficient medical image segmentation network based on large kernel and skip fusion. In: 2024 IEEE International Symposium on Biomedical Imaging (ISBI). IEEE, pp 1–5

Download references

Acknowledgements

This work is supported by the Chengdu University Pattern Recognition and Intelligent Information Processing Sichuan University Key Laboratory open fund (MSSB-2024-13), the Foundation of Guangdong Provincial Key Laboratory of Sensor Technology and Biomedical Instrument (2020B1212060077), Chengdu University Tianfu Culture Digital Innovation Key Laboratory of Sichuan Culture and Tourism Department open project (TFWH-2024-5).

Author information

Authors and Affiliations

School of Computer Science, Chengdu University, Chengdu, China
Hongzhi Li, Zhanghao Ren, Guoqing Zhu & Jiaxi Wang
Key Laboratory of Pattern Recognition and Intelligent Information Processing, Institutions of Higher Education of Sichuan Province, Chengdu University, Chengdu, China
Jiaxi Wang
Key Laboratory of Digital Innovation of Tianfu Culture, Sichuan Provincial Department of Culture and Tourism, Chengdu University, Chengdu, China
Jiaxi Wang

Authors

Hongzhi Li
View author publications
You can also search for this author inPubMed Google Scholar
Zhanghao Ren
View author publications
You can also search for this author inPubMed Google Scholar
Guoqing Zhu
View author publications
You can also search for this author inPubMed Google Scholar
Jiaxi Wang
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

H.L. and J.W. wrote the main manuscript text, and Z.R. prepared figures. All authors reviewed the manuscript.

Corresponding author

Correspondence to Jiaxi Wang.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, H., Ren, Z., Zhu, G. et al. AE-UNet: a composite lung CT image segmentation framework using attention mechanism and edge detection. J Supercomput 81, 331 (2025). https://doi.org/10.1007/s11227-024-06874-4

Download citation

Accepted: 19 December 2024
Published: 24 December 2024
DOI: https://doi.org/10.1007/s11227-024-06874-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

AE-UNet: a composite lung CT image segmentation framework using attention mechanism and edge detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

ACX-UNet: a multi-scale lung parenchyma segmentation study with improved fusion of skip connection and circular cross-features extraction

GLUNet: Global-Local Fusion U-Net for 2D Medical Image Segmentation

Combining CNN and Self-attention-Free Transformer Using Local-Global Attention Fusion for Lung Cancer Segmentation

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now