An Unpaired Cross-Modality Segmentation Framework Using Data Augmentation and Hybrid Convolutional Networks for Segmenting Vestibular Schwannoma and Cochlea

Zhuang, Yuzhou; Liu, Hong; Song, Enmin; Cetinkaya, Coskun; Hung, Chih-Cheng

doi:10.1007/978-3-031-44153-0_8

Yuzhou Zhuang¹⁵,
Hong Liu¹⁵,
Enmin Song¹⁵,
Coskun Cetinkaya¹⁶ &
…
Chih-Cheng Hung¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14092))

Included in the following conference series:

International MICCAI Brainlesion Workshop

207 Accesses

Abstract

The crossMoDA challenge aims to automatically segment the vestibular schwannoma (VS) tumor and cochlea regions of unlabeled high-resolution T2 scans by leveraging labeled contrast-enhanced T1 scans. The 2022 edition extends the segmentation task by including multi-institutional scans. In this work, we proposed an unpaired cross-modality segmentation framework using data augmentation and hybrid convolutional networks. Considering heterogeneous distributions and various image sizes for multi-institutional scans, we apply the min-max normalization for scaling the intensities of all scans between -1 and 1, and use the voxel size resampling and center cropping to obtain fixed-size sub-volumes for training. We adopt two data augmentation methods for effectively learning the semantic information and generating realistic target domain scans: generative and online data augmentation. For generative data augmentation, we use CUT and CycleGAN to generate two groups of realistic T2 volumes with different details and appearances for supervised segmentation training. For online data augmentation, we design a random tumor signal reducing method for simulating the heterogeneity of VS tumor signals. Furthermore, we utilize an advanced hybrid convolutional network with multi-dimensional convolutions to adaptively learn sparse inter-slice information and dense intra-slice information for accurate volumetric segmentation of VS tumor and cochlea regions in anisotropic scans. On the crossMoDA2022 validation dataset, our method produces promising results and achieves the mean DSC values of 72.47% and 76.48% and ASSD values of 3.42 mm and 0.53 mm for VS tumor and cochlea regions, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dorent, R., et al.: CrossMoDA 2021 challenge: Benchmark of cross-modality domain adaptation techniques for vestibular schwannoma and cochlea segmentation. Med. Image Anal., 102628 (2022). https://doi.org/10.1016/j.media.2022.102628.
Dorent, R., et al.L Scribble-based domain adaptation via co-segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 479–489 (2020)
Google Scholar
Shapey, J., et al.: Segmentation of vestibular schwannoma from MRI, an open annotated dataset and baseline algorithm. Sci. Data 8(1), 286 (2021). https://doi.org/10.1038/s41597-021-01064-w
Article Google Scholar
Shapey, J., et al.: An artificial intelligence framework for automatic segmentation and volumetry of vestibular schwannomas from contrast-enhanced T1-weighted and high-resolution T2-weighted MRI. J. Neurosurg. 134(1), 171–179 (2019)
Article Google Scholar
Wang, G., et al.: Automatic segmentation of vestibular schwannoma from T2-weighted mri by deep spatial attention with hardness-weighted loss. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11765, pp. 264–272. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32245-8_30
Dong, Z., et al.: MNet: rethinking 2D/3D Networks for Anisotropic Medical Image Segmentation (2022). http://arxiv.org/abs/2205.04846
Zhu, J.-Y., et al.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2223–2232 (2017)
Google Scholar
Park, T., Efros, A.A., Zhang, R., Zhu, J.-Y.: Contrastive learning for unpaired image-to-image translation. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) Computer Vision – ECCV 2020: Part IX, pp. 319–345. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-58545-7_19
Shin, H., Kim, H., Kim, S., Jun, Y., Eo, T., Hwang, D.: COSMOS: cross-modality unsupervised domain adaptation for 3D medical image segmentation based on Target-aware Domain Translation and Iterative Self-Training, arXiv Prepr. http://arxiv.org/abs/2203.16557
Dong, H., Yu, F., Zhao, J., Dong, B., Zhang, L.: Unsupervised Domain Adaptation in Semantic Segmentation Based on Pixel Alignment and Self-Training, pp. 4–8 (2021). http://arxiv.org/abs/2109.14219
Choi, J.W.: Using Out-of-the-Box Frameworks for Unpaired Image Translation and Image Segmentation for the crossMoDA Challenge, pp. 1–5 (2021). http://arxiv.org/abs/2110.01607
Liu, H., Fan, Y., Cui, C., Su, D., McNeil, A., Dawant, B.M.: Unsupervised Domain Adaptation for Vestibular Schwannoma and Cochlea Segmentation via Semi-supervised Learning and Label Fusion, vol. 1, pp. 1–11 (2022). http://arxiv.org/abs/2201.10647
Huo, Y., et al.: Synseg-net: Synthetic segmentation without target modality ground truth. IEEE Trans. Med. Imaging 38(4), 1016–1025 (2018)
Article Google Scholar
Dou, Q., et al.: PnP-AdaNet: Plug-and-play adversarial domain adaptation network at unpaired cross-modality cardiac segmentation. IEEE Access 7, 99065–99076 (2019)
Article Google Scholar
Chen, C., et al.: Unsupervised bidirectional cross-modality adaptation via deeply synergistic image and feature alignment for medical image segmentation. IEEE Trans. Med. Imaging 39(7), 2494–2505 (2020)
Article Google Scholar
Pei, C., Wu, F., Huang, L., Zhuang, X.: Disentangle domain features for cross-modality cardiac image segmentation. Med. Image Anal. 71, 102078 (2021)
Article Google Scholar
Tsai, Y.-H., et al.: Learning to adapt structured output space for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7472–7481 (2018)
Google Scholar
Vesal, S., et al.: Adapt Everywhere: Unsupervised Adaptation of Point-Clouds and Entropy Minimization for Multi-Modal Cardiac Image Segmentation. IEEE Trans. Med. Imaging 40(7), 1838–1851 (2021)
Article Google Scholar
Liu, H., et al.: A bidirectional multilayer contrastive adaptation network with anatomical structure preservation for unpaired cross-modality medical image segmentation. Comput. Biol. Med., 105964 (2022)
Google Scholar
Yao, K., et al.: A novel 3D unsupervised domain adaptation framework for cross-modality medical image segmentation. IEEE J. Biomed. Heal. Inform. 1 (2022)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, China
Yuzhou Zhuang, Hong Liu & Enmin Song
Center for Machine Vision and Security Research, Kennesaw State University, Marietta, MA, 30060, USA
Coskun Cetinkaya & Chih-Cheng Hung

Authors

Yuzhou Zhuang
View author publications
You can also search for this author in PubMed Google Scholar
Hong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Enmin Song
View author publications
You can also search for this author in PubMed Google Scholar
Coskun Cetinkaya
View author publications
You can also search for this author in PubMed Google Scholar
Chih-Cheng Hung
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hong Liu .

Editor information

Editors and Affiliations

University of Pennsylvania, Philadelphia, PA, USA
Spyridon Bakas
Sano, Center for Computational Personalised Medicine, Kraków, Poland
Alessandro Crimi
University of Pennsylvania, Philadelphia, PA, USA
Ujjwal Baid
Sano, Center for Computational Personalised Medicine, Kraków, Poland
Sylwia Malec
Sano, Center for Computational Personalised Medicine, Kraków, Poland
Monika Pytlarz
University of Pennsylvania, Philadelphia, PA, USA
Bhakti Baheti
German Cancer Research Center, Heidelberg, Germany
Maximilian Zenk
Harvard Medical School, Boston, MA, USA
Reuben Dorent

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhuang, Y., Liu, H., Song, E., Cetinkaya, C., Hung, CC. (2023). An Unpaired Cross-Modality Segmentation Framework Using Data Augmentation and Hybrid Convolutional Networks for Segmenting Vestibular Schwannoma and Cochlea. In: Bakas, S., et al. Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. BrainLes 2022. Lecture Notes in Computer Science, vol 14092. Springer, Cham. https://doi.org/10.1007/978-3-031-44153-0_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-44153-0_8
Published: 05 February 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44152-3
Online ISBN: 978-3-031-44153-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)