Skip to main content

Panoptic Segmentation of Mammograms with Text-to-Image Diffusion Model

  • Conference paper
  • First Online:
Deep Generative Models (DGM4MICCAI 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15224))

Included in the following conference series:

Abstract

Mammography is crucial for breast cancer surveillance and early diagnosis. However, analyzing mammography images is a demanding task for radiologists, who often review hundreds of mammograms daily, leading to overdiagnosis and overtreatment. Computer-Aided Diagnosis (CAD) systems have been developed to assist in this process, but their capabilities, particularly in lesion segmentation, remained limited. With the contemporary advances in deep learning their performance may be improved. Recently, vision-language diffusion models emerged, demonstrating outstanding performance in image generation and transferability to various downstream tasks. We aim to harness their capabilities for breast lesion segmentation in a panoptic setting, which encompasses both semantic and instance-level predictions. Specifically, we propose leveraging pretrained features from a Stable Diffusion model as inputs to a state-of-the-art panoptic segmentation architecture, resulting in accurate delineation of individual breast lesions. To bridge the gap between natural and medical imaging domains, we incorporated a mammography-specific MAM-E diffusion model and BiomedCLIP image and text encoders into this framework. We evaluated our approach on two recently published mammography datasets, CDD-CESM and VinDr-Mammo. For the instance segmentation task, we noted 40.25 AP0.1 and 46.82 AP0.05, as well as 25.44 PQ0.1 and 26.92 PQ0.05. For the semantic segmentation task, we achieved Dice scores of 38.86 and 40.92, respectively.

K. Zhao and J. Prokop—These authors contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://github.com/NVlabs/ODISE.

References

  1. A Abo-El-Rejal, SE Ayman, and F Aymen. “Advances in breast cancer segmentation: A comprehensive review”. In: Acadlore Transactions on AI and Machine Learning 3.2 (2024), pp. 70-83

    Google Scholar 

  2. Luqman Ahmed et al. “Images data practices for semantic segmentation of breast cancer using deep neural network”. In: Journal of Ambient Intelligence and Humanized Computing 14.11 (2023), pp. 15227-15243

    Google Scholar 

  3. Hafiz Muhammd Ali Bhatti et al. “Multi-detection and segmentation of breast lesions based on mask rcnn-fpn”. In: 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE. 2020, pp. 2698- 2704

    Google Scholar 

  4. Yuelong Chuang, Shiqing Zhang, and Xiaoming Zhao. “Deep learningbased panoptic segmentation: Recent advances and perspectives”. In: IET Image Processing 17.10 (2023), pp. 2807-2828

    Google Scholar 

  5. Prafulla Dhariwal and Alexander Nichol. “Diffusion models beat gans on image synthesis”. In: Advances in neural information processing systems 34 (2021), pp. 8780-8794

    Google Scholar 

  6. Zicheng Guo et al. “A review of the current state of the computer-aided diagnosis (CAD) systems for breast cancer diagnosis”. In: Open Life Sciences 17.1 (2022), pp. 1600-1611

    Google Scholar 

  7. Mark D Halling-Brown et al. “Optimam mammography image database: a large-scale resource of mammography images and clinical data”. In: Radiology: Artificial Intelligence 3.1 (2020), e200103

    Google Scholar 

  8. Nada M Hassan, Safwat Hamad, and Khaled Mahar. “Mammogram breast cancer CAD systems for mass detection and classification: a review”. In: Multimedia Tools and Applications 81.14 (2022), pp. 20043-20075

    Google Scholar 

  9. Jonathan Ho, Ajay Jain, and Pieter Abbeel. “Denoising diffusion probabilistic models”. In: Advances in neural information processing systems 33 (2020), pp. 6840-6851

    Google Scholar 

  10. Md Shamim Hossain. “Microc alcification segmentation using modified unet segmentation network from mammogram images”. In: Journal of King Saud University-Computer and Information Sciences 34.2 (2022), pp. 86- 94

    Google Scholar 

  11. Joana Palés Huix et al. “Are Natural Domain Foundation Models Useful for Medical Image Classification?” In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2024, pp. 7634-7643

    Google Scholar 

  12. Rana Khaled et al. “Categorized contrast enhanced mammography dataset for diagnostic and artificial intelligence research”. In: Scientific Data 9.1 (2022), p. 122

    Google Scholar 

  13. Beomyoung Kim, Joonsang Yu, and Sung Ju Hwang. “ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning”. In: arXiv preprint arXiv:2403.20126 (2024)

  14. Alexander Kirillov et al. “Panoptic segmentation”. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019, pp. 9404-9413

    Google Scholar 

  15. Xin Yu Liew, Nazia Hameed, and Jeremie Clos. “A review of computeraided expert systems for breast cancer diagnosis”. In: Cancers 13.11 (2021), p. 2764

    Google Scholar 

  16. Ze Liu et al. “Swin transformer: Hierarchical vision transformer using shifted windows”. In: Proceedings of the IEEE/CVF international conference on computer vision. 2021, pp. 10012-10022

    Google Scholar 

  17. Magnus Løberg et al. “Benefits and harms of mammography screening”. In: Breast cancer research 17 (2015), pp. 1-12

    Article  Google Scholar 

  18. Kosmia Loizidou, Rafaella Elia, and Costas Pitris. “Computer-aided breast cancer detection and classification in mammography: A comprehensive review”. In: Computers in Biology and Medicine 153 (2023), p. 106554

    Google Scholar 

  19. Ricardo Montoya-del-Angel et al. “MAM-E: Mammographic synthetic image generation with diffusion models”. In: Sensors 24.7 (2024), p. 2076

    Google Scholar 

  20. Hieu T Nguyen et al. “VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography”. In: Scientific Data 10.1 (2023), p. 277

    Google Scholar 

  21. American College of Radiology. ACR BI-RADS ATLAS - Mammography. Reporting System. https://www.acr.org/-/media/ACR/Files/RADS/BI-RADS/Mammography-Reporting.pdf. [Accessed 19-03-2024]. 2013

  22. Robin Rombach et al. “High-resolution image synthesis with latent diffusion models”. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022, pp. 10684-10695

    Google Scholar 

  23. Hama Soltani et al. “Breast cancer lesion detection and segmentation based on mask R-CNN”. In: 2021 International Conference on Recent Advances in Mathematics and Informatics (ICRAMI). IEEE. 2021, pp. 1-6.

    Google Scholar 

  24. Hyuna Sung et al. “Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries”. In: CA: a cancer journal for clinicians 71.3 (2021), pp. 209-249

    Google Scholar 

  25. Yuxin Wu et al. Detectron2. https://github.com/facebookresearch/detectron2. 2019

  26. Jiarui Xu et al. “Open-vocabulary panoptic segmentation with text-toimage diffusion models”. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023, pp. 2955-2966

    Google Scholar 

  27. Yutong Yan et al. “Two-stage multi-scale mass segmentation from full mammograms”. In: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI). IEEE. 2021, pp. 1628-1631

    Google Scholar 

  28. Lukas Zbinden et al. “Stochastic segmentation with conditional categorical diffusion models”. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023, pp. 1119-1129

    Google Scholar 

  29. Sheng Zhang et al. Large-Scale Domain-Specific Pretraining for Biomedical Vision-Language Processing. 2023

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jakub Prokop .

Editor information

Editors and Affiliations

Ethics declarations

Disclosure of Interests

The authors have no competing interests to declare that are relevant to the content of this article.

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 479 KB)

Rights and permissions

Reprints and permissions

Copyright information

© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhao, K., Prokop, J., Montalt-Tordera, J., Mohammadi, S. (2025). Panoptic Segmentation of Mammograms with Text-to-Image Diffusion Model. In: Mukhopadhyay, A., Oksuz, I., Engelhardt, S., Mehrof, D., Yuan, Y. (eds) Deep Generative Models. DGM4MICCAI 2024. Lecture Notes in Computer Science, vol 15224. Springer, Cham. https://doi.org/10.1007/978-3-031-72744-3_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-72744-3_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-72743-6

  • Online ISBN: 978-3-031-72744-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics