Analysis of Potential Biases on Mammography Datasets for Deep Learning Model Development

Zufiria, Blanca; López-Linares, Karen; García, María J.; Rebescher, Kristin M.; Lalaguna, Iván; Albertín, Esther; Cimadevila, Maria B.; Garcia, Javier; Ledesma-Carbayo, Maria J.; Macía, Iván

doi:10.1007/978-3-031-17721-7_7

Blanca Zufiria^10,11,
Karen López-Linares^10,12,
María J. García¹⁰,
Kristin M. Rebescher¹⁰,
Iván Lalaguna¹³,
Esther Albertín¹³,
Maria B. Cimadevila¹⁴,
Javier Garcia¹⁴,
Maria J. Ledesma-Carbayo¹¹ &
…
Iván Macía^10,12

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13540))

Included in the following conference series:

International Workshop on Applications of Medical AI

557 Accesses
1 Citations

Abstract

The development of democratized, generalizable deep learning applications for health care systems is challenging as potential biases could easily emerge. This paper provides an overview of the potential biases that appear in image analysis datasets that affect the development and performance of artificial intelligence algorithms. Especially, an exhaustive analysis of mammography data has been carried out at the patient, image and source of origin levels. Furthermore, we summarize some techniques to alleviate these biases for the development of fair deep learning models. We present a learning task to classify negative and positive screening mammographies and analyze the influence of biases in the performance of the algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Hammer, G.P., du Prel, J.B., Blettner, M.: Avoiding bias in observational studies: part 8 in a series of articles on evaluation of scientific publications. Dtsch Arztebl Int. 106, 664 (2009)
Google Scholar
Yu, A.C., Eng, J.: One algorithm may not fit all: how selection bias affects machine learning performance. Radiographics 40, 1932–1937 (2020)
Article Google Scholar
Varoquaux, G., Cheplygina, V.: How I failed machine learning in medical imaging - shortcomings and recommendations. Electr. Eng. Syst. Sci. (2021)
Google Scholar
Tong, S., Kagal, L.: Investigating bias in image classification using model explanations. Comput. Sci. Comput. Vis. Pattern Recogn. (2020)
Google Scholar
Oakden-Rayner, L., Dunnmon, J., Carneiro, G., Re, C.: Hidden stratification causes clinically meaningful failures in machine learning for medical imaging. Comput. Sci. Mach. Learn. (2019)
Google Scholar
K. Winkler, et al.: Association between surgical skin markings in dermoscopic images and diagnostic performance of a deep learning convolutional neural network for melanoma recognition. JAMA Dermatol (2019)
Google Scholar
Pot, M., Kieusseyan, N., Prainsack, B.: Not all biases are bad: equitable and inequitable biases in machine learning and radiology. Insights Imaging 12, 1–10 (2021)
Google Scholar
Larrazabal, A.J., Nieto, N., Peterson, V., Milone, D.H., Ferrante, E. Gender imbalance in medical imaging datasets produces biased classifiers for computer-aided diagnosis. In: Proc. Natl. Acad. Sci. USA , 117, 12592–12594 (2020)
Google Scholar
Park, S.H., Han, K.: Methodologic guide for evaluating clinical performance and effect of artificial intelligence technology for medical diagnosis and prediction. Radiology 286, 800–809 (2003)
Article Google Scholar
Zhao, Q., Adeli, E., Pohl, K.M.: Training confounder-free deep learning models for medical applications. Nat. Commun. 11, 1–9 (2020)
Article Google Scholar
Li, X., Cui, Z., Wu, Y., Gu, L., Harada, T.: Stimating and improving fairness with adversarial learning. Comput. Sci. Comput. Vis. Pattern Recogn. (2021)
Google Scholar
Seyyed-Kalantari, L., Zhang, H., McDermott, M., Chen, I.Y., Ghassemi, M.: Underdiagnosis bias of artificial intelligence algorithms applied to chest radiographs in under-served patient populations. Nat. Med. 27, 2176–2182 (2021)
Article Google Scholar
Catala, O.D.T., et al.: Bias analysis on public x-ray image datasets of pneumonia and Covid-19 patients. IEEE Access. 9, 42370–42383 (2021)
Article Google Scholar
E. H. P. Pooch, P. L. Ballester, R. C. Barros: Can we trust deep learning based diagnosis? the impact of domain shift in chest radiograph classification. Electr. Eng. Syst. Sci. Image Video Process. (2020)
Google Scholar
Zech, J.R., Badgeley, M.A., Liu, M., Costa, A.B., Titano, J.J., Oermann, E.K.: Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross-sectional study. PLoS Med. 15, e1002683 (2018)
Article Google Scholar
Yala, A., et al.: Toward robust mammography-based models for breast cancer risk. Sci Transl. Med. 13, eaba4373 (2021)
Article Google Scholar
Mayer McKinney, S., et al.: International evaluation of an AI system for breast cancer screening. Nature 577, 89–94 (2020)
Article Google Scholar
Wu, N., et al.: Deep neural networks improve radiologists’ performance in breast cancer screening. IEEE Trans. Med. Imaging 39, 1184–1194 (2020)
Article Google Scholar
Ganin, Y., et al.: Domain-adversarial training of neural networks. Stat. Mach. Learn. 17, 2096–2130 (2016)
MathSciNet Google Scholar
Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Visual explanations from deep networks via gradient-based localization. In: IEEE International Conference on Computer Vision (ICCV). (2017)
Google Scholar

Download references

Acknowledgments

This work has been partially funded by FEDER “Una manera de hacer Europa”. This research has been done within the project CADIA - Sistema de Detecciónn de Diversas Patologíss Basado en el Analisis de Imagen con Inteligencia Artificial (DG-SER1-19-003) under the Codigo100 Public Procurement and Innovation Programme by the Galician Health Service - Servizo Galego de Saude (SERGAS) cofunded by the European Regional Development Fund (ERDF).

Author information

Authors and Affiliations

Vicomtech, Basque Research and Technology Alliance, San Sebastián, Spain
Blanca Zufiria, Karen López-Linares, María J. García, Kristin M. Rebescher & Iván Macía
Universidad Politécnica de Madrid, Madrid, Spain
Blanca Zufiria & Maria J. Ledesma-Carbayo
Biodonostia Health Research Institute, San Sebastián, Spain
Karen López-Linares & Iván Macía
Instrumentación y Componentes SA, Inycom, Zaragoza, Spain
Iván Lalaguna & Esther Albertín
Servicio Gallego de Salud, Galicia, Spain
Maria B. Cimadevila & Javier Garcia

Authors

Blanca Zufiria
View author publications
You can also search for this author in PubMed Google Scholar
Karen López-Linares
View author publications
You can also search for this author in PubMed Google Scholar
María J. García
View author publications
You can also search for this author in PubMed Google Scholar
Kristin M. Rebescher
View author publications
You can also search for this author in PubMed Google Scholar
Iván Lalaguna
View author publications
You can also search for this author in PubMed Google Scholar
Esther Albertín
View author publications
You can also search for this author in PubMed Google Scholar
Maria B. Cimadevila
View author publications
You can also search for this author in PubMed Google Scholar
Javier Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Maria J. Ledesma-Carbayo
View author publications
You can also search for this author in PubMed Google Scholar
Iván Macía
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Blanca Zufiria .

Editor information

Editors and Affiliations

University of Pittsburgh, Pittsburgh, PA, USA
Shandong Wu
National Institute of Biomedical Imaging and Bioengineering, Bethesda, MD, USA
Behrouz Shabestari
Stanford University, Stanford, CA, USA
Lei Xing

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zufiria, B. et al. (2022). Analysis of Potential Biases on Mammography Datasets for Deep Learning Model Development. In: Wu, S., Shabestari, B., Xing, L. (eds) Applications of Medical Artificial Intelligence. AMAI 2022. Lecture Notes in Computer Science, vol 13540. Springer, Cham. https://doi.org/10.1007/978-3-031-17721-7_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-17721-7_7
Published: 30 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-17720-0
Online ISBN: 978-3-031-17721-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Analysis of Potential Biases on Mammography Datasets for Deep Learning Model Development