Panoptic Segmentation of Mammograms with Text-to-Image Diffusion Model

Zhao, Kun; Prokop, Jakub; Montalt-Tordera, Javier; Mohammadi, Sadegh

doi:10.1007/978-3-031-72744-3_10

Kun Zhao¹²,
Jakub Prokop¹²,
Javier Montalt-Tordera¹² &
…
Sadegh Mohammadi¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15224))

Included in the following conference series:

MICCAI Workshop on Deep Generative Models

Abstract

Mammography is crucial for breast cancer surveillance and early diagnosis. However, analyzing mammography images is a demanding task for radiologists, who often review hundreds of mammograms daily, leading to overdiagnosis and overtreatment. Computer-Aided Diagnosis (CAD) systems have been developed to assist in this process, but their capabilities, particularly in lesion segmentation, remained limited. With the contemporary advances in deep learning their performance may be improved. Recently, vision-language diffusion models emerged, demonstrating outstanding performance in image generation and transferability to various downstream tasks. We aim to harness their capabilities for breast lesion segmentation in a panoptic setting, which encompasses both semantic and instance-level predictions. Specifically, we propose leveraging pretrained features from a Stable Diffusion model as inputs to a state-of-the-art panoptic segmentation architecture, resulting in accurate delineation of individual breast lesions. To bridge the gap between natural and medical imaging domains, we incorporated a mammography-specific MAM-E diffusion model and BiomedCLIP image and text encoders into this framework. We evaluated our approach on two recently published mammography datasets, CDD-CESM and VinDr-Mammo. For the instance segmentation task, we noted 40.25 AP_0.1 and 46.82 AP_0.05, as well as 25.44 PQ_0.1 and 26.92 PQ_0.05. For the semantic segmentation task, we achieved Dice scores of 38.86 and 40.92, respectively.

K. Zhao and J. Prokop—These authors contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

MMBCD: Multimodal Breast Cancer Detection from Mammograms with Clinical History

Enhancing Medical Image Analysis with MA-DTNet: A Dual Task Network Guided by Morphological Attention

ConnectedUNets++: Mass Segmentation from Whole Mammographic Images

Notes

1.
https://github.com/NVlabs/ODISE.

References

A Abo-El-Rejal, SE Ayman, and F Aymen. “Advances in breast cancer segmentation: A comprehensive review”. In: Acadlore Transactions on AI and Machine Learning 3.2 (2024), pp. 70-83
Google Scholar
Luqman Ahmed et al. “Images data practices for semantic segmentation of breast cancer using deep neural network”. In: Journal of Ambient Intelligence and Humanized Computing 14.11 (2023), pp. 15227-15243
Google Scholar
Hafiz Muhammd Ali Bhatti et al. “Multi-detection and segmentation of breast lesions based on mask rcnn-fpn”. In: 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE. 2020, pp. 2698- 2704
Google Scholar
Yuelong Chuang, Shiqing Zhang, and Xiaoming Zhao. “Deep learningbased panoptic segmentation: Recent advances and perspectives”. In: IET Image Processing 17.10 (2023), pp. 2807-2828
Google Scholar
Prafulla Dhariwal and Alexander Nichol. “Diffusion models beat gans on image synthesis”. In: Advances in neural information processing systems 34 (2021), pp. 8780-8794
Google Scholar
Zicheng Guo et al. “A review of the current state of the computer-aided diagnosis (CAD) systems for breast cancer diagnosis”. In: Open Life Sciences 17.1 (2022), pp. 1600-1611
Google Scholar
Mark D Halling-Brown et al. “Optimam mammography image database: a large-scale resource of mammography images and clinical data”. In: Radiology: Artificial Intelligence 3.1 (2020), e200103
Google Scholar
Nada M Hassan, Safwat Hamad, and Khaled Mahar. “Mammogram breast cancer CAD systems for mass detection and classification: a review”. In: Multimedia Tools and Applications 81.14 (2022), pp. 20043-20075
Google Scholar
Jonathan Ho, Ajay Jain, and Pieter Abbeel. “Denoising diffusion probabilistic models”. In: Advances in neural information processing systems 33 (2020), pp. 6840-6851
Google Scholar
Md Shamim Hossain. “Microc alcification segmentation using modified unet segmentation network from mammogram images”. In: Journal of King Saud University-Computer and Information Sciences 34.2 (2022), pp. 86- 94
Google Scholar
Joana Palés Huix et al. “Are Natural Domain Foundation Models Useful for Medical Image Classification?” In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2024, pp. 7634-7643
Google Scholar
Rana Khaled et al. “Categorized contrast enhanced mammography dataset for diagnostic and artificial intelligence research”. In: Scientific Data 9.1 (2022), p. 122
Google Scholar
Beomyoung Kim, Joonsang Yu, and Sung Ju Hwang. “ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning”. In: arXiv preprint arXiv:2403.20126 (2024)
Alexander Kirillov et al. “Panoptic segmentation”. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019, pp. 9404-9413
Google Scholar
Xin Yu Liew, Nazia Hameed, and Jeremie Clos. “A review of computeraided expert systems for breast cancer diagnosis”. In: Cancers 13.11 (2021), p. 2764
Google Scholar
Ze Liu et al. “Swin transformer: Hierarchical vision transformer using shifted windows”. In: Proceedings of the IEEE/CVF international conference on computer vision. 2021, pp. 10012-10022
Google Scholar
Magnus Løberg et al. “Benefits and harms of mammography screening”. In: Breast cancer research 17 (2015), pp. 1-12
Article Google Scholar
Kosmia Loizidou, Rafaella Elia, and Costas Pitris. “Computer-aided breast cancer detection and classification in mammography: A comprehensive review”. In: Computers in Biology and Medicine 153 (2023), p. 106554
Google Scholar
Ricardo Montoya-del-Angel et al. “MAM-E: Mammographic synthetic image generation with diffusion models”. In: Sensors 24.7 (2024), p. 2076
Google Scholar
Hieu T Nguyen et al. “VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography”. In: Scientific Data 10.1 (2023), p. 277
Google Scholar
American College of Radiology. ACR BI-RADS ATLAS - Mammography. Reporting System. https://www.acr.org/-/media/ACR/Files/RADS/BI-RADS/Mammography-Reporting.pdf. [Accessed 19-03-2024]. 2013
Robin Rombach et al. “High-resolution image synthesis with latent diffusion models”. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022, pp. 10684-10695
Google Scholar
Hama Soltani et al. “Breast cancer lesion detection and segmentation based on mask R-CNN”. In: 2021 International Conference on Recent Advances in Mathematics and Informatics (ICRAMI). IEEE. 2021, pp. 1-6.
Google Scholar
Hyuna Sung et al. “Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries”. In: CA: a cancer journal for clinicians 71.3 (2021), pp. 209-249
Google Scholar
Yuxin Wu et al. Detectron2. https://github.com/facebookresearch/detectron2. 2019
Jiarui Xu et al. “Open-vocabulary panoptic segmentation with text-toimage diffusion models”. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023, pp. 2955-2966
Google Scholar
Yutong Yan et al. “Two-stage multi-scale mass segmentation from full mammograms”. In: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI). IEEE. 2021, pp. 1628-1631
Google Scholar
Lukas Zbinden et al. “Stochastic segmentation with conditional categorical diffusion models”. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023, pp. 1119-1129
Google Scholar
Sheng Zhang et al. Large-Scale Domain-Specific Pretraining for Biomedical Vision-Language Processing. 2023
Google Scholar

Download references

Author information

Authors and Affiliations

Bayer AG, Leverkusen, Germany
Kun Zhao, Jakub Prokop, Javier Montalt-Tordera & Sadegh Mohammadi

Authors

Kun Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jakub Prokop
View author publications
You can also search for this author in PubMed Google Scholar
Javier Montalt-Tordera
View author publications
You can also search for this author in PubMed Google Scholar
Sadegh Mohammadi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jakub Prokop .

Editor information

Editors and Affiliations

TU Darmstadt, Darmstadt, Germany
Anirban Mukhopadhyay
Department of Computer Engineering, Istanbul Technical University, Istanbul, Türkiye
Ilkay Oksuz
Universitätsklinikum Heidelberg, Heidelberg, Germany
Sandy Engelhardt
University of Regensburg, Regensburg, Germany
Dorit Mehrof
Department of Electrical Engineering, City University of Hong Kong, Hong Kong, Hong Kong
Yixuan Yuan

Ethics declarations

Disclosure of Interests

The authors have no competing interests to declare that are relevant to the content of this article.

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 479 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, K., Prokop, J., Montalt-Tordera, J., Mohammadi, S. (2025). Panoptic Segmentation of Mammograms with Text-to-Image Diffusion Model. In: Mukhopadhyay, A., Oksuz, I., Engelhardt, S., Mehrof, D., Yuan, Y. (eds) Deep Generative Models. DGM4MICCAI 2024. Lecture Notes in Computer Science, vol 15224. Springer, Cham. https://doi.org/10.1007/978-3-031-72744-3_10

Download citation

DOI: https://doi.org/10.1007/978-3-031-72744-3_10
Published: 09 October 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-72743-6
Online ISBN: 978-3-031-72744-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Panoptic Segmentation of Mammograms with Text-to-Image Diffusion Model