Skip to main content

Advertisement

Log in

MD-UNet: a medical image segmentation network based on mixed depthwise convolution

  • Original Article
  • Published:
Medical & Biological Engineering & Computing Aims and scope Submit manuscript

Abstract

In the process of cancer diagnosis and treatment, accurate extraction of the lesion area helps the doctor to judge the condition. Currently, medical image segmentation algorithms based on UNet have been verified to be able to play an important role in clinical diagnosis. However, these methods still have the following drawbacks in extracting the region of interest (ROI): (1) ignoring the intra-class variability of medical images. (2) Failure to obtain effective feature redundancy. To address these problems, a U-shaped medical image segmentation network based on a Mixed depthwise convolution residual module (MDRM), called MD-UNet, is proposed in this paper. In MD-UNet, the MDRM built with a Mixed depthwise convolution attention block (MDAB) captures both local and global dependencies in the image to mitigate the effects of intra-class differences. MDAB captures valid redundant features and further captures global features of the input data. At the same time, the lightweight MDAB senses changes in the receptive field and generates multiple feature mappings. Compared with UNeXt on the ISIC2018 dataset, the MD-UNet segmentation accuracy Dice and IoU are improved by 1.33% and 1.91%, respectively. The code is available at https://github.com/Cloud-Liu/MD-UNet.

Graphical abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

The ISIC2018 dataset is available on the official website https://challenge.isic-archive.com/data. The MoNuSeg dataset is available on the website https://github.com/tiangexiang/BiO-Net.

References

  1. Codella NC et al (2018) Skin lesion analysis toward melanoma detection: a challenge at the 2017 international symposium on biomedical imaging (isbi), hosted by the international skin imaging collaboration (isic). In: 2018 IEEE 15th international symposium on biomedical imaging (ISBI 2018). IEEE, pp 168–172

  2. Kumar N, Verma R, Sharma S, Bhargava S, Vahadane A, Sethi A (2017) A dataset and a technique for generalized nuclear segmentation for computational pathology. IEEE Trans Med Imaging 36(7):1550–1560

    Article  PubMed  Google Scholar 

  3. Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18. Springer, pp 234–241

  4. Mehta S, Mercan E, Bartlett J, Weaver D, Elmore JG, Shapiro L (2018) Y-Net: joint segmentation and classification for diagnosis of breast biopsy images. Medical Image Computing and Computer Assisted Intervention–MICCAI 2018: 21st International Conference, Granada, Spain, September 16–20, 2018, Proceedings, Part II 11. Springer, pp 893–901

  5. Milletari F, Navab N, Ahmadi SA (2016) V-net: Fully convolutional neural networks for volumetric medical image segmentation. In: 2016 fourth international conference on 3D vision (3DV). IEEE, pp 565–571

  6. Chen J et al (2021) Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306

  7. Vaswani A et al (2017) Attention is all you need. Adv Neural Inf Process Syst 30:5998–6008

    Google Scholar 

  8. Yu T, Li X, Cai Y, Sun M, Li P (2022) S2-mlp: Spatial-shift mlp architecture for vision. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp 297–306

  9. Lian D, Yu Z, Sun X, Gao S (2021) As-mlp: An axial shifted mlp architecture for vision. arXiv preprint arXiv:2107.08391

  10. Zheng H, He P, Chen W, Zhou M (2022) Mixing and shifting: exploiting global and local dependencies in vision mlps. arXiv preprint arXiv:2202.06510

  11. Touvron H et al (2022) Resmlp: Feedforward networks for image classification with data-efficient training. IEEE Trans Pattern Anal Mach Intell 45(4):5314–5321

    Google Scholar 

  12. Lv J et al (2022) CM-MLP: cascade multi-scale MLP with axial context relation encoder for edge segmentation of medical image. In: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, pp 1100–1107

  13. Howard AG et al (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861

  14. Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1580–1589

  15. Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. International conference on machine learning. PMLR, pp 6105–6114

  16. Li P, Wang L, Luo Y (2020) Ghost-UNet: an efficient method for wound surface segmentation. Basic Clin Pharmacol Toxicol 127:288

    Google Scholar 

  17. Wei G, Zhang Z, Lan C, Lu Y, Chen Z (2023) Active token mixer. In: Proceedings of the AAAI Conference on Artificial Intelligence 37(3):2759–2767

  18. Zhou Z, Siddiquee MMR, Tajbakhsh N, Liang J (2019) Unet++: redesigning skip connections to exploit multiscale features in image segmentation. IEEE Trans Med Imaging 39(6):1856–1867

    Article  PubMed  PubMed Central  Google Scholar 

  19. Huang H et al (2020) Unet3+: A full-scale connected Unet for medical image segmentation. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1055–1059

  20. Lian S, Luo Z, Zhong Z, Lin X, Su S, Li S (2018) Attention guided u-net for accurate iris segmentation. J Vis Commun Image Represent 56:296–304

    Article  Google Scholar 

  21. Xiang T, Zhang C, Liu D, Song Y, Huang H, Cai W (2020) BiO-Net: learning recurrent bi-directional connections for encoder-decoder architecture. Medical Image Computing and Computer Assisted Intervention–MICCAI 2020: 23rd International Conference, Lima, Peru, October 4–8, 2020, Proceedings, Part I 23. Springer, pp 74–84

  22. Ruan J, Xiang S, Xie M, Liu T, Fu Y (2022) MALUNet: a multi-attention and light-weight UNet for skin lesion segmentation. In: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, pp 1150–1156

  23. Dosovitskiy A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929

  24. Liu Z et al (2021) Swin transformer: hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 10012–10022

  25. Hatamizadeh A et al (2022) Unetr: Transformers for 3d medical image segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 574–584

  26. Wang H, Cao P, Wang J, Zaiane, OR (2022) Uctransnet: rethinking the skip connections in u-net from a channel-wise perspective with transformer. In: Proceedings of the AAAI conference on artificial intelligence 36(3):2441–2449

  27. Rajagopal A, Nirmala V (2021) Convolutional gated MLP: combining convolutions & gMLP. arXiv preprint arXiv:2111.03940

  28. Li J, Hassani A, Walton S, Shi H (2023) Convmlp: Hierarchical convolutional mlps for vision. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 6306–6315

  29. Hou Q, Jiang Z, Yuan L, Cheng MM, Yan S, Feng J (2022) Vision permutator: a permutable mlp-like architecture for visual recognition. IEEE Trans Pattern Anal Mach Intell 45(1):1328–1334

    Article  PubMed  Google Scholar 

  30. Valanarasu JMJ, Patel VM (2022) Unext: Mlp-based rapid medical image segmentation network. Medical Image Computing and Computer Assisted Intervention–MICCAI 2022: 25th International Conference, Singapore, September 18–22, 2022, Proceedings, Part V. Cham: Springer Nature Switzerland, pp 23–33

  31. Liu R, Li Y, Tao L, Liang D, Zheng HT (2022) Are we ready for a new paradigm shift? a survey on visual deep mlp. Patterns 3(7):100520

    Article  PubMed  PubMed Central  Google Scholar 

  32. Guo S et al (2023) Causal knowledge fusion for 3D cross-modality cardiac image segmentation. Inf Fusion 99:101864

    Article  Google Scholar 

  33. Zhuang S, Li F, Raj ANJ, Ding W, Zhou W, Zhuang Z (2021) Automatic segmentation for ultrasound image of carotid intimal-media based on improved superpixel generation algorithm and fractal theory. Comput Methods Programs Biomed 205:106084

    Article  PubMed  Google Scholar 

  34. Zhou Z, Qi L, Yang X, Ni D, Shi Y (2022) Generalizable cross-modality medical image segmentation via style augmentation and dual normalization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 20856–20865

  35. Zhang J et al (2021) Interactive medical image segmentation via a point-based interaction. Artif Intell Med 111:101998

    Article  PubMed  Google Scholar 

  36. Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning. pmlr, pp 448–456

  37. Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90

    Article  Google Scholar 

  38. Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141

  39. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

Download references

Funding

This work is supported by the Introduction and Cultivation Program for Young Innovative Talents of Universities in Shandong(2021QCYY003).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Xing Wang or Ji Chen.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, Y., Yao, S., Wang, X. et al. MD-UNet: a medical image segmentation network based on mixed depthwise convolution. Med Biol Eng Comput 62, 1201–1212 (2024). https://doi.org/10.1007/s11517-023-03005-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11517-023-03005-8

Keywords

Navigation