MD-UNet: a medical image segmentation network based on mixed depthwise convolution

Liu, Yun; Yao, Shuanglong; Wang, Xing; Chen, Ji; Li, Xiaole

doi:10.1007/s11517-023-03005-8

MD-UNet: a medical image segmentation network based on mixed depthwise convolution

Original Article
Published: 30 December 2023

Volume 62, pages 1201–1212, (2024)
Cite this article

Medical & Biological Engineering & Computing Aims and scope Submit manuscript

Yun Liu¹,
Shuanglong Yao¹,
Xing Wang¹,
Ji Chen¹ &
…
Xiaole Li¹

412 Accesses
2 Citations
Explore all metrics

Abstract

In the process of cancer diagnosis and treatment, accurate extraction of the lesion area helps the doctor to judge the condition. Currently, medical image segmentation algorithms based on UNet have been verified to be able to play an important role in clinical diagnosis. However, these methods still have the following drawbacks in extracting the region of interest (ROI): (1) ignoring the intra-class variability of medical images. (2) Failure to obtain effective feature redundancy. To address these problems, a U-shaped medical image segmentation network based on a Mixed depthwise convolution residual module (MDRM), called MD-UNet, is proposed in this paper. In MD-UNet, the MDRM built with a Mixed depthwise convolution attention block (MDAB) captures both local and global dependencies in the image to mitigate the effects of intra-class differences. MDAB captures valid redundant features and further captures global features of the input data. At the same time, the lightweight MDAB senses changes in the receptive field and generates multiple feature mappings. Compared with UNeXt on the ISIC2018 dataset, the MD-UNet segmentation accuracy Dice and IoU are improved by 1.33% and 1.91%, respectively. The code is available at https://github.com/Cloud-Liu/MD-UNet.

Graphical abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer

Article 04 September 2023

A novel medical image segmentation approach by using multi-branch segmentation network based on local and global information synchronous learning

Article Open access 25 April 2023

GLUNet: Global-Local Fusion U-Net for 2D Medical Image Segmentation

Data availability

The ISIC2018 dataset is available on the official website https://challenge.isic-archive.com/data. The MoNuSeg dataset is available on the website https://github.com/tiangexiang/BiO-Net.

References

Codella NC et al (2018) Skin lesion analysis toward melanoma detection: a challenge at the 2017 international symposium on biomedical imaging (isbi), hosted by the international skin imaging collaboration (isic). In: 2018 IEEE 15th international symposium on biomedical imaging (ISBI 2018). IEEE, pp 168–172
Kumar N, Verma R, Sharma S, Bhargava S, Vahadane A, Sethi A (2017) A dataset and a technique for generalized nuclear segmentation for computational pathology. IEEE Trans Med Imaging 36(7):1550–1560
Article PubMed Google Scholar
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18. Springer, pp 234–241
Mehta S, Mercan E, Bartlett J, Weaver D, Elmore JG, Shapiro L (2018) Y-Net: joint segmentation and classification for diagnosis of breast biopsy images. Medical Image Computing and Computer Assisted Intervention–MICCAI 2018: 21st International Conference, Granada, Spain, September 16–20, 2018, Proceedings, Part II 11. Springer, pp 893–901
Milletari F, Navab N, Ahmadi SA (2016) V-net: Fully convolutional neural networks for volumetric medical image segmentation. In: 2016 fourth international conference on 3D vision (3DV). IEEE, pp 565–571
Chen J et al (2021) Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306
Vaswani A et al (2017) Attention is all you need. Adv Neural Inf Process Syst 30:5998–6008
Google Scholar
Yu T, Li X, Cai Y, Sun M, Li P (2022) S2-mlp: Spatial-shift mlp architecture for vision. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp 297–306
Lian D, Yu Z, Sun X, Gao S (2021) As-mlp: An axial shifted mlp architecture for vision. arXiv preprint arXiv:2107.08391
Zheng H, He P, Chen W, Zhou M (2022) Mixing and shifting: exploiting global and local dependencies in vision mlps. arXiv preprint arXiv:2202.06510
Touvron H et al (2022) Resmlp: Feedforward networks for image classification with data-efficient training. IEEE Trans Pattern Anal Mach Intell 45(4):5314–5321
Google Scholar
Lv J et al (2022) CM-MLP: cascade multi-scale MLP with axial context relation encoder for edge segmentation of medical image. In: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, pp 1100–1107
Howard AG et al (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861
Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1580–1589
Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. International conference on machine learning. PMLR, pp 6105–6114
Li P, Wang L, Luo Y (2020) Ghost-UNet: an efficient method for wound surface segmentation. Basic Clin Pharmacol Toxicol 127:288
Google Scholar
Wei G, Zhang Z, Lan C, Lu Y, Chen Z (2023) Active token mixer. In: Proceedings of the AAAI Conference on Artificial Intelligence 37(3):2759–2767
Zhou Z, Siddiquee MMR, Tajbakhsh N, Liang J (2019) Unet++: redesigning skip connections to exploit multiscale features in image segmentation. IEEE Trans Med Imaging 39(6):1856–1867
Article PubMed PubMed Central Google Scholar
Huang H et al (2020) Unet3+: A full-scale connected Unet for medical image segmentation. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1055–1059
Lian S, Luo Z, Zhong Z, Lin X, Su S, Li S (2018) Attention guided u-net for accurate iris segmentation. J Vis Commun Image Represent 56:296–304
Article Google Scholar
Xiang T, Zhang C, Liu D, Song Y, Huang H, Cai W (2020) BiO-Net: learning recurrent bi-directional connections for encoder-decoder architecture. Medical Image Computing and Computer Assisted Intervention–MICCAI 2020: 23rd International Conference, Lima, Peru, October 4–8, 2020, Proceedings, Part I 23. Springer, pp 74–84
Ruan J, Xiang S, Xie M, Liu T, Fu Y (2022) MALUNet: a multi-attention and light-weight UNet for skin lesion segmentation. In: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, pp 1150–1156
Dosovitskiy A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929
Liu Z et al (2021) Swin transformer: hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 10012–10022
Hatamizadeh A et al (2022) Unetr: Transformers for 3d medical image segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 574–584
Wang H, Cao P, Wang J, Zaiane, OR (2022) Uctransnet: rethinking the skip connections in u-net from a channel-wise perspective with transformer. In: Proceedings of the AAAI conference on artificial intelligence 36(3):2441–2449
Rajagopal A, Nirmala V (2021) Convolutional gated MLP: combining convolutions & gMLP. arXiv preprint arXiv:2111.03940
Li J, Hassani A, Walton S, Shi H (2023) Convmlp: Hierarchical convolutional mlps for vision. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 6306–6315
Hou Q, Jiang Z, Yuan L, Cheng MM, Yan S, Feng J (2022) Vision permutator: a permutable mlp-like architecture for visual recognition. IEEE Trans Pattern Anal Mach Intell 45(1):1328–1334
Article PubMed Google Scholar
Valanarasu JMJ, Patel VM (2022) Unext: Mlp-based rapid medical image segmentation network. Medical Image Computing and Computer Assisted Intervention–MICCAI 2022: 25th International Conference, Singapore, September 18–22, 2022, Proceedings, Part V. Cham: Springer Nature Switzerland, pp 23–33
Liu R, Li Y, Tao L, Liang D, Zheng HT (2022) Are we ready for a new paradigm shift? a survey on visual deep mlp. Patterns 3(7):100520
Article PubMed PubMed Central Google Scholar
Guo S et al (2023) Causal knowledge fusion for 3D cross-modality cardiac image segmentation. Inf Fusion 99:101864
Article Google Scholar
Zhuang S, Li F, Raj ANJ, Ding W, Zhou W, Zhuang Z (2021) Automatic segmentation for ultrasound image of carotid intimal-media based on improved superpixel generation algorithm and fractal theory. Comput Methods Programs Biomed 205:106084
Article PubMed Google Scholar
Zhou Z, Qi L, Yang X, Ni D, Shi Y (2022) Generalizable cross-modality medical image segmentation via style augmentation and dual normalization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 20856–20865
Zhang J et al (2021) Interactive medical image segmentation via a point-based interaction. Artif Intell Med 111:101998
Article PubMed Google Scholar
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning. pmlr, pp 448–456
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90
Article Google Scholar
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

Download references

Funding

This work is supported by the Introduction and Cultivation Program for Young Innovative Talents of Universities in Shandong(2021QCYY003).

Author information

Authors and Affiliations

School of Information Science and Engineering, Linyi University, Linyi, 276000, China
Yun Liu, Shuanglong Yao, Xing Wang, Ji Chen & Xiaole Li

Authors

Yun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shuanglong Yao
View author publications
You can also search for this author in PubMed Google Scholar
Xing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ji Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaole Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xing Wang or Ji Chen.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Liu, Y., Yao, S., Wang, X. et al. MD-UNet: a medical image segmentation network based on mixed depthwise convolution. Med Biol Eng Comput 62, 1201–1212 (2024). https://doi.org/10.1007/s11517-023-03005-8

Download citation

Received: 01 August 2023
Accepted: 16 December 2023
Published: 30 December 2023
Issue Date: April 2024
DOI: https://doi.org/10.1007/s11517-023-03005-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MD-UNet: a medical image segmentation network based on mixed depthwise convolution