Abstract
Mammography is crucial for breast cancer surveillance and early diagnosis. However, analyzing mammography images is a demanding task for radiologists, who often review hundreds of mammograms daily, leading to overdiagnosis and overtreatment. Computer-Aided Diagnosis (CAD) systems have been developed to assist in this process, but their capabilities, particularly in lesion segmentation, remained limited. With the contemporary advances in deep learning their performance may be improved. Recently, vision-language diffusion models emerged, demonstrating outstanding performance in image generation and transferability to various downstream tasks. We aim to harness their capabilities for breast lesion segmentation in a panoptic setting, which encompasses both semantic and instance-level predictions. Specifically, we propose leveraging pretrained features from a Stable Diffusion model as inputs to a state-of-the-art panoptic segmentation architecture, resulting in accurate delineation of individual breast lesions. To bridge the gap between natural and medical imaging domains, we incorporated a mammography-specific MAM-E diffusion model and BiomedCLIP image and text encoders into this framework. We evaluated our approach on two recently published mammography datasets, CDD-CESM and VinDr-Mammo. For the instance segmentation task, we noted 40.25 AP0.1 and 46.82 AP0.05, as well as 25.44 PQ0.1 and 26.92 PQ0.05. For the semantic segmentation task, we achieved Dice scores of 38.86 and 40.92, respectively.
K. Zhao and J. Prokop—These authors contributed equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
A Abo-El-Rejal, SE Ayman, and F Aymen. “Advances in breast cancer segmentation: A comprehensive review”. In: Acadlore Transactions on AI and Machine Learning 3.2 (2024), pp. 70-83
Luqman Ahmed et al. “Images data practices for semantic segmentation of breast cancer using deep neural network”. In: Journal of Ambient Intelligence and Humanized Computing 14.11 (2023), pp. 15227-15243
Hafiz Muhammd Ali Bhatti et al. “Multi-detection and segmentation of breast lesions based on mask rcnn-fpn”. In: 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE. 2020, pp. 2698- 2704
Yuelong Chuang, Shiqing Zhang, and Xiaoming Zhao. “Deep learningbased panoptic segmentation: Recent advances and perspectives”. In: IET Image Processing 17.10 (2023), pp. 2807-2828
Prafulla Dhariwal and Alexander Nichol. “Diffusion models beat gans on image synthesis”. In: Advances in neural information processing systems 34 (2021), pp. 8780-8794
Zicheng Guo et al. “A review of the current state of the computer-aided diagnosis (CAD) systems for breast cancer diagnosis”. In: Open Life Sciences 17.1 (2022), pp. 1600-1611
Mark D Halling-Brown et al. “Optimam mammography image database: a large-scale resource of mammography images and clinical data”. In: Radiology: Artificial Intelligence 3.1 (2020), e200103
Nada M Hassan, Safwat Hamad, and Khaled Mahar. “Mammogram breast cancer CAD systems for mass detection and classification: a review”. In: Multimedia Tools and Applications 81.14 (2022), pp. 20043-20075
Jonathan Ho, Ajay Jain, and Pieter Abbeel. “Denoising diffusion probabilistic models”. In: Advances in neural information processing systems 33 (2020), pp. 6840-6851
Md Shamim Hossain. “Microc alcification segmentation using modified unet segmentation network from mammogram images”. In: Journal of King Saud University-Computer and Information Sciences 34.2 (2022), pp. 86- 94
Joana Palés Huix et al. “Are Natural Domain Foundation Models Useful for Medical Image Classification?” In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2024, pp. 7634-7643
Rana Khaled et al. “Categorized contrast enhanced mammography dataset for diagnostic and artificial intelligence research”. In: Scientific Data 9.1 (2022), p. 122
Beomyoung Kim, Joonsang Yu, and Sung Ju Hwang. “ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning”. In: arXiv preprint arXiv:2403.20126 (2024)
Alexander Kirillov et al. “Panoptic segmentation”. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019, pp. 9404-9413
Xin Yu Liew, Nazia Hameed, and Jeremie Clos. “A review of computeraided expert systems for breast cancer diagnosis”. In: Cancers 13.11 (2021), p. 2764
Ze Liu et al. “Swin transformer: Hierarchical vision transformer using shifted windows”. In: Proceedings of the IEEE/CVF international conference on computer vision. 2021, pp. 10012-10022
Magnus Løberg et al. “Benefits and harms of mammography screening”. In: Breast cancer research 17 (2015), pp. 1-12
Kosmia Loizidou, Rafaella Elia, and Costas Pitris. “Computer-aided breast cancer detection and classification in mammography: A comprehensive review”. In: Computers in Biology and Medicine 153 (2023), p. 106554
Ricardo Montoya-del-Angel et al. “MAM-E: Mammographic synthetic image generation with diffusion models”. In: Sensors 24.7 (2024), p. 2076
Hieu T Nguyen et al. “VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography”. In: Scientific Data 10.1 (2023), p. 277
American College of Radiology. ACR BI-RADS ATLAS - Mammography. Reporting System. https://www.acr.org/-/media/ACR/Files/RADS/BI-RADS/Mammography-Reporting.pdf. [Accessed 19-03-2024]. 2013
Robin Rombach et al. “High-resolution image synthesis with latent diffusion models”. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022, pp. 10684-10695
Hama Soltani et al. “Breast cancer lesion detection and segmentation based on mask R-CNN”. In: 2021 International Conference on Recent Advances in Mathematics and Informatics (ICRAMI). IEEE. 2021, pp. 1-6.
Hyuna Sung et al. “Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries”. In: CA: a cancer journal for clinicians 71.3 (2021), pp. 209-249
Yuxin Wu et al. Detectron2. https://github.com/facebookresearch/detectron2. 2019
Jiarui Xu et al. “Open-vocabulary panoptic segmentation with text-toimage diffusion models”. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023, pp. 2955-2966
Yutong Yan et al. “Two-stage multi-scale mass segmentation from full mammograms”. In: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI). IEEE. 2021, pp. 1628-1631
Lukas Zbinden et al. “Stochastic segmentation with conditional categorical diffusion models”. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023, pp. 1119-1129
Sheng Zhang et al. Large-Scale Domain-Specific Pretraining for Biomedical Vision-Language Processing. 2023
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
Disclosure of Interests
The authors have no competing interests to declare that are relevant to the content of this article.
1 Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhao, K., Prokop, J., Montalt-Tordera, J., Mohammadi, S. (2025). Panoptic Segmentation of Mammograms with Text-to-Image Diffusion Model. In: Mukhopadhyay, A., Oksuz, I., Engelhardt, S., Mehrof, D., Yuan, Y. (eds) Deep Generative Models. DGM4MICCAI 2024. Lecture Notes in Computer Science, vol 15224. Springer, Cham. https://doi.org/10.1007/978-3-031-72744-3_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-72744-3_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-72743-6
Online ISBN: 978-3-031-72744-3
eBook Packages: Computer ScienceComputer Science (R0)