Skip to main content

Advertisement

Log in

A medical unsupervised domain adaptation framework based on Fourier transform image translation and multi-model ensemble self-training strategy

  • Original Article
  • Published:
International Journal of Computer Assisted Radiology and Surgery Aims and scope Submit manuscript

Abstract

Purpose

Well-established segmentation models will suffer performance degradation when deployed on data with heterogeneous features, especially in the field of medical image analysis. Although researchers have proposed many approaches to address this problem in recent years, most of them are feature-adaptation-based adversarial networks, the problems such as training instability often arise in adversarial training. To ameliorate this challenge and improve the robustness of processing data with different distributions, we propose a novel unsupervised domain adaptation framework for cross-domain medical image segmentation.

Methods

In our proposed approach, Fourier transform guided images translation and multi-model ensemble self-training are integrated into a unified framework. First, after Fourier transform, the amplitude spectrum of source image is replaced with that of target image, and reconstructed by the inverse Fourier transform. Second, we augment target dataset with the synthetic cross-domain images, performing supervised learning using the original source set labels while implementing regularization by entropy minimization on predictions of unlabeled target data. We employ several segmentation networks with different hyperparameters simultaneously, pseudo-labels are generated by averaging their outputs and comparing to confidence threshold, and gradually optimize the quality of pseudo-labels through multiple rounds self-training.

Results

We employed our framework to two liver CT datasets for bidirectional adaptation experiments. In both experiments, compared to the segmentation network without domain alignment, dice similarity coefficient (DSC) increased by nearly 34% and average symmetric surface distance (ASSD) decreased by about 10. The DSC values were also improved by 10.8% and 6.7%, respectively, compared to the existing model.

Conclusion

We propose a Fourier transform-based UDA framework, the experimental results and comparisons demonstrate that the proposed method can effectively diminish the performance degradation caused by domain shift and performs best on the cross-domain segmentation tasks. Our proposed multi-model ensemble training strategy can also improve the robustness of the segmentation system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Rekka M, Nawres K, Henda N, Saoussen HZ (2021) A bilinear convolutional neural network for lung nodules classification on CT images. Int J CARS 16:91–101. https://doi.org/10.1007/s11548-020-02283-z

    Article  Google Scholar 

  2. NiroomandFam B, Nikravanshalmani A, Khalilian M (2021) Automatic breast mass detection in mammograms using density of wavelet coefficients and a patch-based CNN. Int J CARS 16:1805–1815. https://doi.org/10.1007/s11548-021-02443-9

    Article  Google Scholar 

  3. Tan T, Wang Z, Du H, Xu J, Qiu B (2021) Lightweight pyramid network with spatial attention mechanism for accurate retinal vessel segmentation. Int J CARS 16:673–682. https://doi.org/10.1007/s11548-021-02344-x

    Article  Google Scholar 

  4. Matthew P, Neumann M, Iyyer M, Gardner M, Zettlemoyer L (2018) Deep Contextualized Word Representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. pp 2227–2237. New Orleans, Louisiana. Association for Computational Linguistics. https://aclanthology.org/N18-1202

  5. Yang YC, Soatto S (2020) FDA: Fourier domain adaptation for semantic segmentation. In: 2020 IEEE conference on computer vision and pattern recognition (CVPR). pp 4085–4095. https://doi.org/10.1109/CVPR42600.2020.00414

  6. Yang YC, Lao D, Sundaramoorthi G, Soatto S (2020) Phase consistent ecological domain adaptation. In: 2020 IEEE conference on computer vision and pattern recognition (CVPR). pp 9008–9017.https://doi.org/10.1109/CVPR42600.2020.00903

  7. Piotrowski LN, Campbell FW (1982) A demonstration of the visual importance and flexibility of spatial-frequency amplitude and phase. Perception 11(3):337–346

    Article  CAS  PubMed  Google Scholar 

  8. Oppenheim AV, Lim JS (1981) The importance of phase in signals. Proc IEEE 69(5):529–541

    Article  Google Scholar 

  9. Hansen BC, Hess RF (2007) Structural sparseness and spatial phase alignment in natural scenes. J Opt Soc Am A 24(7):1873–1885. https://doi.org/10.1364/JOSAA.24.001873

    Article  Google Scholar 

  10. Tarvainen A, Valpola H (2017) Mean teachers are better role models: weight-averaged consistency targets improve semi-supervised deep learning results. In: 2020 conference and workshop on neural information processing systems (NeurIPS). https://doi.org/10.48550/arXiv.1703.01780

  11. Laine S, Aila T (2017) Temporal Ensembling for semi-supervised learning. In: Proceedings of the 5th international conference on learning representations (ICLR). https://doi.org/10.48550/arXiv.1610.02242

  12. Bilic P, Christ PF, Vorontsov E, Chlebus G, Chen H, Dou Q, Fu CW, Han X, Heng PA, Hesser J, Kadoury S, Kopczyski T, Le M, Li CM, Li XM, Lipkova J, Lowengrub J, Meine H, Moltz JH, Pal C, Piraud M, Qi XJ, Qi J, Rempfler M, Roth K, Schenk A, Sekuboyina A, Zhou P, Hulsemeyer C, Beetz M, Ettlinger F, Gruen F, Kaissis G, Lohfer F, Braren R, Holch J, Hofmann F, Sommer W, Heinemann V, Jacobs C, Mamani GEH, Ginneken B, Chartrand G, Tang A, Drozdzal M, Kadoury S, Ben-Cohen A, Klang E, Amitai M, Konen E, Greenspan H, Moreau J, Hostettler A, Soler L, Vivanti R, Szeskin A, Cohain Naama, Sosna J, Joskowicz L, Kumar A, Kore A, Wang CL, Feng DG, Li F, Krishnamurthi G, He J, Wu JR, Kim J, Zhou JY, Ma J, Li JB, Maninis KK, Kaluva CK, Bi L, Khened M, Beliver M, Lin QZ, Yang XP, Yuan YD, Chen YN, Li YQ, Qiu YD, Wu YL, Menze B (2019) The liver tumor segmentation benchmark (LiTS). https://arxiv.org/abs/1901.04056v1

  13. Kavur AE, Gezer NS, Baris M, Aslan S, Conze PH, Groza V, Pham D, Chatterjee S, Ernst P, Ozkan S, Baydar B, Lachinov D, Han S, Pauli J, Isensee F, Perkonigg M, Sathish R, Rajan R, Sheet D, Dovletov G, Speck O, Nurnberger A, Hein M, Akar GB, Unal G, Dicle O, Selver MA (2021) CHAOS Challenge - combined (CT-MR) healthy abdominal organ segmentation. Med Image Anal. https://doi.org/10.1016/j.media.2020.101950

    Article  PubMed  Google Scholar 

  14. Ronneberger O, Fischer P, Brox T (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention. In: Proceedings of the 18th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI). pp 234–241.https://doi.org/10.1007/978-3-319-24574-4_28

  15. Zhu JY, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: 2017 IEEE international conference on computer vision (ICCV). pp 2242–2251. https://doi.org/10.1109/ICCV.2017.244

  16. Vu TH, Jain H, Bucher M, Cord M, Perez P (2019). ADVENT: Adversarial Entropy Minimization for Domain Adaptation in Semantic Segmentation. In: 2019 IEEE conference on computer vision and pattern recognition (CVPR). pp 2517–2526 https://doi.org/10.1109/CVPR.2019.00262

Download references

Acknowledgements

This research work is supported by the grants from National Natural Science Foundation of China (61673007). We sincerely thank reviewers for their good advice.

Funding

This research work is supported by the grants from National Natural Science Foundation of China (61673007).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tao Gong.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Consent to participate

This article does not contain patient data.

Consent for publication

This article does not contain patient data.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jiang, K., Gong, T. & Quan, L. A medical unsupervised domain adaptation framework based on Fourier transform image translation and multi-model ensemble self-training strategy. Int J CARS 18, 1885–1894 (2023). https://doi.org/10.1007/s11548-023-02867-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11548-023-02867-5

Keywords