Abstract
In the process of cancer diagnosis and treatment, accurate extraction of the lesion area helps the doctor to judge the condition. Currently, medical image segmentation algorithms based on UNet have been verified to be able to play an important role in clinical diagnosis. However, these methods still have the following drawbacks in extracting the region of interest (ROI): (1) ignoring the intra-class variability of medical images. (2) Failure to obtain effective feature redundancy. To address these problems, a U-shaped medical image segmentation network based on a Mixed depthwise convolution residual module (MDRM), called MD-UNet, is proposed in this paper. In MD-UNet, the MDRM built with a Mixed depthwise convolution attention block (MDAB) captures both local and global dependencies in the image to mitigate the effects of intra-class differences. MDAB captures valid redundant features and further captures global features of the input data. At the same time, the lightweight MDAB senses changes in the receptive field and generates multiple feature mappings. Compared with UNeXt on the ISIC2018 dataset, the MD-UNet segmentation accuracy Dice and IoU are improved by 1.33% and 1.91%, respectively. The code is available at https://github.com/Cloud-Liu/MD-UNet.
Graphical abstract
Similar content being viewed by others
Data availability
The ISIC2018 dataset is available on the official website https://challenge.isic-archive.com/data. The MoNuSeg dataset is available on the website https://github.com/tiangexiang/BiO-Net.
References
Codella NC et al (2018) Skin lesion analysis toward melanoma detection: a challenge at the 2017 international symposium on biomedical imaging (isbi), hosted by the international skin imaging collaboration (isic). In: 2018 IEEE 15th international symposium on biomedical imaging (ISBI 2018). IEEE, pp 168–172
Kumar N, Verma R, Sharma S, Bhargava S, Vahadane A, Sethi A (2017) A dataset and a technique for generalized nuclear segmentation for computational pathology. IEEE Trans Med Imaging 36(7):1550–1560
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18. Springer, pp 234–241
Mehta S, Mercan E, Bartlett J, Weaver D, Elmore JG, Shapiro L (2018) Y-Net: joint segmentation and classification for diagnosis of breast biopsy images. Medical Image Computing and Computer Assisted Intervention–MICCAI 2018: 21st International Conference, Granada, Spain, September 16–20, 2018, Proceedings, Part II 11. Springer, pp 893–901
Milletari F, Navab N, Ahmadi SA (2016) V-net: Fully convolutional neural networks for volumetric medical image segmentation. In: 2016 fourth international conference on 3D vision (3DV). IEEE, pp 565–571
Chen J et al (2021) Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306
Vaswani A et al (2017) Attention is all you need. Adv Neural Inf Process Syst 30:5998–6008
Yu T, Li X, Cai Y, Sun M, Li P (2022) S2-mlp: Spatial-shift mlp architecture for vision. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp 297–306
Lian D, Yu Z, Sun X, Gao S (2021) As-mlp: An axial shifted mlp architecture for vision. arXiv preprint arXiv:2107.08391
Zheng H, He P, Chen W, Zhou M (2022) Mixing and shifting: exploiting global and local dependencies in vision mlps. arXiv preprint arXiv:2202.06510
Touvron H et al (2022) Resmlp: Feedforward networks for image classification with data-efficient training. IEEE Trans Pattern Anal Mach Intell 45(4):5314–5321
Lv J et al (2022) CM-MLP: cascade multi-scale MLP with axial context relation encoder for edge segmentation of medical image. In: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, pp 1100–1107
Howard AG et al (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861
Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1580–1589
Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. International conference on machine learning. PMLR, pp 6105–6114
Li P, Wang L, Luo Y (2020) Ghost-UNet: an efficient method for wound surface segmentation. Basic Clin Pharmacol Toxicol 127:288
Wei G, Zhang Z, Lan C, Lu Y, Chen Z (2023) Active token mixer. In: Proceedings of the AAAI Conference on Artificial Intelligence 37(3):2759–2767
Zhou Z, Siddiquee MMR, Tajbakhsh N, Liang J (2019) Unet++: redesigning skip connections to exploit multiscale features in image segmentation. IEEE Trans Med Imaging 39(6):1856–1867
Huang H et al (2020) Unet3+: A full-scale connected Unet for medical image segmentation. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1055–1059
Lian S, Luo Z, Zhong Z, Lin X, Su S, Li S (2018) Attention guided u-net for accurate iris segmentation. J Vis Commun Image Represent 56:296–304
Xiang T, Zhang C, Liu D, Song Y, Huang H, Cai W (2020) BiO-Net: learning recurrent bi-directional connections for encoder-decoder architecture. Medical Image Computing and Computer Assisted Intervention–MICCAI 2020: 23rd International Conference, Lima, Peru, October 4–8, 2020, Proceedings, Part I 23. Springer, pp 74–84
Ruan J, Xiang S, Xie M, Liu T, Fu Y (2022) MALUNet: a multi-attention and light-weight UNet for skin lesion segmentation. In: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, pp 1150–1156
Dosovitskiy A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929
Liu Z et al (2021) Swin transformer: hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 10012–10022
Hatamizadeh A et al (2022) Unetr: Transformers for 3d medical image segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 574–584
Wang H, Cao P, Wang J, Zaiane, OR (2022) Uctransnet: rethinking the skip connections in u-net from a channel-wise perspective with transformer. In: Proceedings of the AAAI conference on artificial intelligence 36(3):2441–2449
Rajagopal A, Nirmala V (2021) Convolutional gated MLP: combining convolutions & gMLP. arXiv preprint arXiv:2111.03940
Li J, Hassani A, Walton S, Shi H (2023) Convmlp: Hierarchical convolutional mlps for vision. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 6306–6315
Hou Q, Jiang Z, Yuan L, Cheng MM, Yan S, Feng J (2022) Vision permutator: a permutable mlp-like architecture for visual recognition. IEEE Trans Pattern Anal Mach Intell 45(1):1328–1334
Valanarasu JMJ, Patel VM (2022) Unext: Mlp-based rapid medical image segmentation network. Medical Image Computing and Computer Assisted Intervention–MICCAI 2022: 25th International Conference, Singapore, September 18–22, 2022, Proceedings, Part V. Cham: Springer Nature Switzerland, pp 23–33
Liu R, Li Y, Tao L, Liang D, Zheng HT (2022) Are we ready for a new paradigm shift? a survey on visual deep mlp. Patterns 3(7):100520
Guo S et al (2023) Causal knowledge fusion for 3D cross-modality cardiac image segmentation. Inf Fusion 99:101864
Zhuang S, Li F, Raj ANJ, Ding W, Zhou W, Zhuang Z (2021) Automatic segmentation for ultrasound image of carotid intimal-media based on improved superpixel generation algorithm and fractal theory. Comput Methods Programs Biomed 205:106084
Zhou Z, Qi L, Yang X, Ni D, Shi Y (2022) Generalizable cross-modality medical image segmentation via style augmentation and dual normalization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 20856–20865
Zhang J et al (2021) Interactive medical image segmentation via a point-based interaction. Artif Intell Med 111:101998
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning. pmlr, pp 448–456
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Funding
This work is supported by the Introduction and Cultivation Program for Young Innovative Talents of Universities in Shandong(2021QCYY003).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, Y., Yao, S., Wang, X. et al. MD-UNet: a medical image segmentation network based on mixed depthwise convolution. Med Biol Eng Comput 62, 1201–1212 (2024). https://doi.org/10.1007/s11517-023-03005-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11517-023-03005-8