Abstract
A variety of convolutional neural network (CNN) based methods for medical image segmentation have achieved outstanding performance, however, inherently suffered from a limited ability to capture long-range dependencies. In addition, some of the features integrated by complex strategy may be used repeatedly, resulting in more redundant information. To solve this problem, we propose a novel feature integration network by conducting hierarchical transformer under both explicit label and boundary guidance for accurate medical image segmentation. First, the encoder hierarchically extracts multiscale features by each hybrid attention module consisting of residual and Transformer blocks. Second, the parallel features from adjacent layers are integrated via cross fusion block to complement semantics to low-level features. Then, both deep boundary and label supervision are deployed for layer-wise decoders. Finally, the output features are fused by each layer instead of a top-to-down way to integrate multiscale semantic representation for prediction. The proposed method was evaluated on Synapse dataset with Dice of 81.63% and Promise12 dataset with Dice of 92.47%, 95HD of 2.06mm, and aRVD of 4.09%. Extensive experiments demonstrate that our proposed method achieves promising performance on multi-organ CT and prostate MR image segmentation.
Similar content being viewed by others
Data availability
The raw data can be shared if the researchers need to do research on relevant topic and cite our paper.
References
Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, Wang M (2022) Swin-unet: unet-like pure transformer for medical image segmentation. In: European conference on computer vision. pp 205–218
Carion N, Massa F, Synnaeve G, Usunier N, Kirillov A, Zagoruyko S (2020) End-to-end object detection with transformers. In: European conference on computer vision. pp 213–229
Chen J, Wang X, Guo Z, Zhang X, Sun J (2021) Dynamic region-aware convolution. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 8060–8069
Chen J, Lu Y, Yu Q, Luo X, Adeli E, Wang Y, Lu L, Yuille A, Zhou Y (2021) Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306. pp 1–13
Chen L, Bentley P, Mori K, Misawa K, Fujiwara M, Rueckert D (2018) DRINet for medical image segmentation. IEEE Trans Med Imaging 37(11):2453–2462
Çiçek Ö, Abdulkadir A, Lienkamp S S, Brox T, Ronneberger O (2016) 3D U-net: learning dense volumetric segmentation from sparse annotation. In: Medical image computing and computer assisted intervention–MICCAI 2016. pp 424–432
Dosovitskiy A, Beyer L, Kolesnikov A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929. pp 1–21
Fang X, Yan P (2020) Multi-organ segmentation over partially labeled datasets with multi-scale feature abstraction. IEEE Trans Med Imaging 39(11):3619–3629
Gao Y, Zhou M, Metaxas DN (2021) UTNet: a hybrid transformer architecture for medical image segmentation. In: Medical image computing and computer assisted intervention–MICCAI 2021. pp 61–71
Nooshin G et al (2019) Automatic segmentation of prostate MRI using convolutional neural networks: investigating the impact of network architecture on the accuracy of volume measurement and MRI-ultrasound registration. Med Image Anal 58:101558
Gridach M, Voiculescu I (2021) Dopnet: densely oriented pooling network for medical image segmentation. In: 2021 IEEE 18th international symposium on biomedical imaging (ISBI). pp 1714–1717
Gu Z, Cheng J, Fu H et al (2019) CE-net: context encoder network for 2d medical image segmentation. IEEE Trans Med Imaging 38(10):2281–2292
Ran G et al (2020) CA-net: comprehensive attention convolutional neural networks for explainable medical image segmentation. IEEE Trans Med Imaging 40(2):699–711
Guo X, Liu J, Yuan Y (2022) Semantic-oriented labeled-to-unlabeled distribution translation for image segmentation. IEEE Trans Med Imaging 2(2022):41
Hatamizadeh A, Tang Y, Nath V et al (2022) UNETR: transformers for 3d medical image segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision. pp 1748–1758
Huang H, Lin L, Tong R et al. (2020) UNet 3+: a full-scale connected unet for medical image segmentation. In: ICASSP 2020-2020 IEEE international conference on acoustics, speech and signal processing. pp 1055–1059
Fabian I et al (2021) nnU-net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods 18(2):203–211
Jia H, Xia Y, Cai W, Fulham M, Feng DD (2017) Prostate segmentation in MR images using ensemble deep convolutional neural networks. In: 2017 IEEE 14th international symposium on biomedical imaging (ISBI 2017). pp 762–765
Landman B, Xu Z, Igelsias J, Styner M, Langerak T, Klein A (2015) MICCAI multi-atlas labeling beyond the cranial vault–workshop and challenge. In: MICCAI multi-atlas labeling beyond cranial vault—workshop challenge, vol 5. p 12
Lin A, Chen B, Xu J, Zhang Z, Lu G, Zhang D (2022) Ds-transunet: dual swin transformer u-net for medical image segmentation. IEEE Trans Instrum Meas 71:1–15
Geert L et al (2014) Evaluation of prostate segmentation algorithms for MRI: the PROMISE12 challenge. Med Image Anal 18(2):359–373
Litjens G et al (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60–88
Liu Z, Lin Y, Cao Y et al (2021) Swin transformer: hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 9992–10002
Meyer A, Chlebus G, Rak M et al (2021) Anisotropic 3D multi-stream CNN for accurate prostate segmentation from multi-planar MRI. Comput Methods Prog Biomed 200:105821
Milletari F, Navab N, Ahmadi SA (2016) V-net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 fourth international conference on 3D vision (3DV). pp 565–571
Oktay O, Schlemper J, Folgoc L L et al (2018) Attention u-net: learning where to look for the pancreas. arXiv preprint arXiv:1804.03999. pp 1–10
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: Medical image computing and computer-assisted intervention–MICCAI 2015. pp 234–241
Tong T et al (2015) Discriminative dictionary learning for abdominal multi-organ segmentation. Med Image Anal 23(1):92–104
Valanarasu JMJ, Oza P, Hacihaliloglu I, Patel VM (2021) Medical transformer: gated axial-attention for medical image segmentation. In: Medical image computing and computer assisted intervention–MICCAI 2021. pp 36–46
Vaswani A, Shazeer N, Parmar N et al (2017) Attention is all you need. Adv Neural Inf Proces Syst 30:6000–6010
Wang W, Chen C, Ding M, Yu H, Zha S, Li J (2021) Transbts: multimodal brain tumor segmentation using transformer. In: Medical image computing and computer assisted intervention–MICCAI 2021. pp 109–119
Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). pp 7794–7803
Wang H, Cao P, Wang J, Zaiane O R (2022) Uctransnet: rethinking the skip connections in u-net from a channel-wise perspective with transformer. In: Proceedings of the AAAI conference on artificial intelligence, vol 36, no 3. pp 2441–2449
Bo W et al (2019) Deeply supervised 3D fully convolutional networks with group dilated convolution for automatic MRI prostate segmentation. Med Phys 46(4):1707–1718
Wu W, Qian C, Yang S, Wang Q, Cai Y, Zhou Q (2018) Look at boundary: a boundary-aware face alignment algorithm. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 2129–2138
Xie Y, Zhang J, Shen C, Xia Y (2021) Cotr: efficiently bridging cnn and transformer for 3d medical image segmentation. In: Medical image computing and computer assisted intervention–MICCAI 2021. pp 171–180
Biting Y et al (2019) Ea-GANs: edge-aware generative adversarial networks for cross-modality MR image synthesis. IEEE Trans Med Imaging 38(7):1750–1762
Zhang Q, Yang YB (2021) Rest: an efficient transformer for visual recognition. Adv Neural Inf Proces Syst 34:15475–15485
Zhang Y, Liu H, Hu Q (2021) Transfuse: fusing transformers and cnns for medical image segmentation. In: Medical image computing and computer assisted intervention–MICCAI 2021. pp 14–24
Zhang S, Fu H, Yan Y et al (2019) Attention guided network for retinal image segmentation. In: Medical image computing and computer assisted intervention–MICCAI 2019. pp 797–805
Zhao JX, Liu JJ, Fan DP, Cao Y, Yang J, Cheng MM (2019) EGNet: edge guidance network for salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 8778–8787
Zhao Y, Li H, Wan S, Sekuboyina A, Hu X, Tetteh G, Piraud M, Menze B (2019) Knowledge-aided convolutional neural network for small organ segmentation. IEEE J Biomed Health Inform 23(4):1363–1373
Zhou Z, Rahman Siddiquee MM, Tajbakhsh N, Liang J (2018) Unet++: a nested u-net architecture for medical image segmentation. In: Deep learning in medical image analysis and multimodal learning for clinical decision support, ML-CDS 2018. pp 3–11
Zhu Q, Du B, Yan P (2020) Boundary-weighted domain adaptive neural network for prostate MR image segmentation. IEEE Trans Med Imaging 39(3):753–763
Funding
This work was supported by the National Natural Science Foundation of China (No. 62041108); the Natural Science Foundation of Ningxia (No. 2020AAC03029); Innovation and Entrepreneurship Project for Returnees in Ningxia 2020.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
No conflict of interest exits in the submission of this manuscript and the manuscript is approved by all authors for publication.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, F., Wang, B. Boundary-guided feature integration network with hierarchical transformer for medical image segmentation. Multimed Tools Appl 83, 8955–8969 (2024). https://doi.org/10.1007/s11042-023-15948-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15948-z