Abstract
The primary impediments in lung CT image segmentation stem from the ambiguity in edge definition and the inadequate segmentation accuracy. Addressing these issues, this paper introduces a novel composite lung CT image segmentation framework that integrates an attention mechanism with an edge detection operator. We utilize residual dynamic convolutions as the encoder to augment the network's capability for extracting and representing nuanced lesion features. Sobel edge detection is integrated into the skip connections to facilitate the transmission and utilization of edge information. In particular, we introduce an information fusion attention module for deeper layers, optimizing feature reorganization and utilization by attention mechanisms and dilated convolution. Experimental evaluations on two lung CT datasets reveal that our proposed AE-UNet achieves outstanding segmentation performance, surpassing the best baseline network by an average of 0.93%.








Similar content being viewed by others
Data availability
No datasets were generated or analyzed during the current study.
References
Feng D, Haase-Schütz C, Rosenbaum L et al (2020) Deep multi-modal object detection and semantic segmentation for autonomous driving: datasets, methods, and challenges. IEEE Trans Intell Transp Syst 22(3):1341–1360
Yuan X, Shi J, Gu L (2021) A review of deep learning methods for semantic segmentation of remote sensing imagery. Expert Syst Appl 169:114417
Otsu N (1975) A threshold selection method from gray-level histograms. Automatica 11(285–296):23–27
Kanopoulos N, Vasanthavada N, Baker RL (1988) Design of an image edge detection filter using the Sobel operator. IEEE J Solid-State Circuits 23(2):358–367
Deng CX, Wang GB, Yang XR (2013) Image edge detection algorithm based on improved canny operator. In: 2013 International Conference on Wavelet Analysis and Pattern Recognition. IEEE, pp 168–172
Nock R, Nielsen F (2004) Statistical region merging. IEEE Trans Pattern Anal Mach Intell 26(11):1452–1458
Boykov Y, Kolmogorov V (2004) An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans Pattern Anal Mach Intell 26(9):1124–1137
Elnakib A, Gimel’farb G, Suri JS, et al. (2011) Medical image segmentation: a brief survey. Multi Modality State-of-the-Art Medical Image Segmentation and Registration Methodologies: Volume II, pp 1–39
Ramesh KKD, Kumar GK, Swapna K et al (2021) A review of medical image segmentation algorithms. EAI Endorsed Trans Pervasive Health Technol 7(27):e6–e6
Patil DD, Deore SG (2013) Medical image segmentation: a review. Int J Comput Sci Mob Comput 2(1):22–27
Minaee S, Boykov Y, Porikli F et al (2021) Image segmentation using deep learning: a survey. IEEE Trans Pattern Anal Mach Intell 44(7):3523–3542
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3431–3440
Chen LC, Papandreou G, Kokkinos I, et al. (2014) Semantic image segmentation with deep convolutional nets and fully connected crfs. arxiv preprint https://arxiv.org/abs/1412.7062
Chen LC, Papandreou G, Kokkinos I et al (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848
Chen LC, Papandreou G, Schroff F, et al. (2017) Rethinking atrous convolution for semantic image segmentation. arxiv preprint https://arxiv.org/abs/1706.05587
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, part III 18. Springer, pp 234–241
Zhou Z, Siddiquee MMR, Tajbakhsh N et al (2019) Unet++: redesigning skip connections to exploit multiscale features in image segmentation. IEEE Trans Med Imaging 39(6):1856–1867
Huang H, Lin L, Tong R, et al. (2020) Unet 3+: A full-scale connected unet for medical image segmentation. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1055–1059
Çiçek Ö, Abdulkadir A, Lienkamp SS, et al. (2016) 3D U-Net: learning dense volumetric segmentation from sparse annotation. In: Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016: 19th International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II 19. Springer, pp 424–432
Nyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. preprint https://arxiv.org/abs/1409.1556
He K, Zhang X, Ren S, et al. (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 770–778
Oktay O, Schlemper J, Folgoc LL et al. (2018) Attention u-net: Learning where to look for the pancreas. arxiv preprint https://arxiv.org/abs/1804.03999
Jiang Z, Dong J et al (2020) Attention gate resU-Net for automatic MRI brain tumor entation. IEEE Access 8:58533–58545
Dhaubhadel PM, Lee JK, Tian Q (2024) Attention-aware DAE for automated solar coronal loop segmentation
Li S, Dong M, Du G et al (2019) Attention dense-u-net for automatic breast mass segmentation in digital mammogram. Ieee Access 7:59037–59047
Iandola F, Moskewicz M, Karayev S et al. (2014) Densenet: Implementing efficient convnet descriptor pyramids. arxiv preprint https://arxiv.org/abs/1404.1869
Liu J, Kuang H et al (2022) A fully automated multimodal MRI-based multi-task learning for a segmentation and IDH genotyping. IEEE Trans Med Imaging 41(6):1532
Wang H, Chen Z et al (2022) TransUNet+: redesigning the skip connection to enhance features in cal image segmentation. Knowl-Based Syst 256:109859
Cao P, Wang J, et al. Uctransnet: rethinking the skip connections in u-net from a channel-perspective with transformer. In: Proceedings of the AAAI Conference on Artificial Intelligence 36(3): 2441–2449
Dharampal VM (2015) Methods of image edge detection: a review. J Electr Electron Syst 4(2):2332–2796
Zunair H, Hamza AB (2021) Sharp U-Net: depthwise convolutional network for biomedical image segmentation. Comput Biol Med 136:104699
Song H, Wang Y, Zeng S et al (2023) OAU-net: outlined Attention U-net for biomedical image segmentation. Biomed Signal Process Control 79:104038
Lin Y, Zhang D, Fang X, et al. (2023) Rethinking boundary detection in deep learning models for medical image segmentation. In: International Conference on Information Processing in Medical Imaging. Springer, Cham, pp 730–742
Xu M, Ma Q, Zhang H et al (2024) MEF-UNet: an end-to-end ultrasound image segmentation algorithm based on multi-scale feature extraction and fusion. Comput Med Imaging Graph 114:102370
Milletari F, Navab N, Ahmadi SA (2016) V-net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV). IEEE, pp 565–571
Xiao X, Lian S, Luo Z, et al. (2018) Weighted res-unet for high-quality retina vessel segmentation. In: 2018 9th International Conference on Information Technology in Medicine and Education (ITME). IEEE, pp 327–331
Ibtehaz N, Rahman MS (2020) MultiResUNet: Rethinking the U-Net architecture for multimodal biomedical image segmentation. Neural Netw 121:74–87
Vaswani A, Shazeer N, Parmar N, et al. (2017) Attention is all you need. Adv Neural Inf Process Syst 30
Chen J, Lu Y, Yu Q, et al. (2021) Transunet: transformers make strong encoders for medical image segmentation. arxiv preprint https://arxiv.org/abs/2102.04306
Liu Z, Lin Y, Cao Y, et al. (2021) Swin transformer: Hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 10012–10022
Cao H, Wang Y, Chen J, et al. (2022) Swin-unet: Unet-like pure transformer for medical image segmentation. In: European Conference on Computer Vision. Springer, Cham, pp 205–218
Wenxuan W, Chen C, Meng D et al. (2021) Transbts: multimodal brain tumor segmentation using transformer. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, pp 109–119
Qiu Z, Xu T, Langerman J et al (2021) A deep learning approach for segmentation, classification, and visualization of 3-D high-frequency ultrasound images of mouse embryos. IEEE Trans Ultrason Ferroelectr Freq Control 68(7):2460–2471
Kuo J, Qiu Z, Aristizabal O, et al. (2018) Automatic body localization and brain ventricle segmentation in 3D high frequency ultrasound images of mouse embryos. In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018). IEEE, pp 635–639
Qiu Z, Langerman J, Nair N, et al. (2018) Deep bv: A fully automated system for brain ventricle localization and segmentation in 3d ultrasound images of embryonic mice. In: 2018 IEEE Signal Processing in Medicine and Biology Symposium (SPMB). IEEE, pp 1–6
Xu T, Qiu Z, Das W, et al. (2020) Deep mouse: an end-to-end auto-context refinement framework for brain ventricle & body segmentation in embryonic mice ultrasound volumes. In: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI). IEEE, pp 122–126
Qiu Z, Nair N, Langerman J, et al. (2019) Automatic mouse embryo brain ventricle & body segmentation and mutant classification from ultrasound data using deep learning. In: 2019 IEEE International Ultrasonics Symposium (IUS). IEEE, pp 12–15
Chen Y, Dai X, Liu M, et al. (2020) Dynamic convolution: Attention over convolution kernels. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 11030–11039
Wang Q, Wu B, Zhu P, et al. (2020) ECA-Net: Efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 11534–11542
Zhu X, Cheng D, Zhang Z, et al. (2019) An empirical study of spatial attention mechanisms in deep networks. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 6688–6697
Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions. arxiv preprint https://arxiv.org/abs/1511.07122
Mader KS (2017) Finding and measuring lungs in CT data, 2017, https://www.kaggle.com/kmader/finding-lungs-in-ct-data/data/
Jun M et al. (2020) COVID-19 CT lung and infection segmentation dataset. Zenodo 20
Chen Y, Wang K, Liao X et al (2019) Channel-Unet: a spatial channel-wise convolutional neural network for liver and tumors segmentation. Front Genet 10:1110
Yin Y, Han Z, Jian M et al (2023) AMSUnet: a neural network using atrous multi-scale convolution for medical image segmentation. Comput Biol Med 162:107120
Tang F, Ding J, Quan Q, et al. (2024) Cmunext: an efficient medical image segmentation network based on large kernel and skip fusion. In: 2024 IEEE International Symposium on Biomedical Imaging (ISBI). IEEE, pp 1–5
Acknowledgements
This work is supported by the Chengdu University Pattern Recognition and Intelligent Information Processing Sichuan University Key Laboratory open fund (MSSB-2024-13), the Foundation of Guangdong Provincial Key Laboratory of Sensor Technology and Biomedical Instrument (2020B1212060077), Chengdu University Tianfu Culture Digital Innovation Key Laboratory of Sichuan Culture and Tourism Department open project (TFWH-2024-5).
Author information
Authors and Affiliations
Contributions
H.L. and J.W. wrote the main manuscript text, and Z.R. prepared figures. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, H., Ren, Z., Zhu, G. et al. AE-UNet: a composite lung CT image segmentation framework using attention mechanism and edge detection. J Supercomput 81, 331 (2025). https://doi.org/10.1007/s11227-024-06874-4
Accepted:
Published:
DOI: https://doi.org/10.1007/s11227-024-06874-4