Abstract
Convolutional neural networks (CNN), while effective in medical diagnostics, have shown concerning biases against underrepresented patient groups. In this study, we provide an in-depth exploration of these biases in the realm of image-based Alzheimer’s disease (AD) diagnosis using state-of-the-art CNNs, building upon and extending prior investigations. We dissect performance-based and calibration-based biases across patient subgroups differentiated by sex, ethnicity, age, educational qualifications, and APOE4 status. Our findings reveal substantial disparities in model performance and calibration, underscoring the challenges intersectional identities impose. Such biases highlight the importance of fairness analysis in fostering equitable AI applications in the AD domain. Appropriate mitigation actions should be carried out to ensure that, those who need it, receive healthcare attention independently from the subgroup they belong to.
Data used in preparation of this article was obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (http://www.adni-info.org/). The investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data, but did not participate in analysis or writing of this report.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alzheimer’s disease facts and figures. Alzheimers Dement 18(4), 700–789 (2022)
Wen, J., et al.: Convolutional neural networks for classification of Alzheimer’s disease: overview and reproducible evaluation. Med. Image Anal. 63, 101694 (2020)
Illakiya, T., Karthik, R.: Automatic detection of Alzheimer’s disease using deep learning models and neuro-imaging: current trends and future perspectives. Neuroinformatics 21, 339–364 (2023)
Seyyed-Kalantari, L., Liu, G., McDermott, M.B., Ghassemi, M.: CheXclusion: fairness gaps in deep chest X-ray classifiers. In: Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing, vol. 26, pp. 232–243 (2020)
Ricci, L.M.A., Echeveste, R., Ferrante, E.: Addressing fairness in artificial intelligence for medical imaging. Nat. Commun. 13, 4581 (2022)
Guo, C., Pleiss, G., Sun, Y., Weinberger, K.Q.: On calibration of modern neural networks. In: Proceedings of the 34th International Conference on Machine Learning, pp. 1321–1330 (2017)
Bae, B., et al.: Identification of Alzheimer’s disease using a convolutional neural network model based on T1-weighted magnetic resonance imaging. Sci. Rep. 10, 1–10 (2020)
El-Sappagh, S., et al.: Trustworthy artificial intelligence in Alzheimer’s disease: state of the art, opportunities, and challenges. Artif. Intell. Rev. (2023)
Petersen, E., et al.: Feature robustness and sex differences in medical imaging: a case study in MRI-based Alzheimer’s disease detection. In: MICCAI 2022: 25th International Conference, pp. 88–98 (2022)
Mendelson, A.F., Zuluaga, M.A., Lorenzi, M., Hutton, B.F.: Selection bias in the reported performances of AD classification pipelines. NeuroImage Clin. 14, 400–416 (2017)
Jack, C.R., et al.: The Alzheimer’s disease neuroimaging initiative (ADNI): MRI methods. J. Magn. Resonan. Imaging 27(4), 685–691 (2008)
Tustison, N.J., et al.: N4ITK: improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320 (2010)
Avants, B.B., Epstein, C.L., Grossman, M., Gee, J.C.: Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med. Image Anal. 12(1), 26–41 (2008)
Fonov, V., Dadar, M.: The PREVENT-AD research group. In: Louis Collins, D. (ed.) Deep Learning of Quality Control for Stereotaxic Registration of Human Brain MRI. bioRxiv (2018)
Chen, I.Y., Johansson, F.D., Sontag, D.: Why is my classifier discriminatory? In: Proceedings of the 32nd International Conference on Neural Information Processing Systems (NIPS’18), pp. 3543–3554. Curran Associates Inc., Red Hook (2018)
Hardt, M., Price, E., Srebro, N.: Equality of opportunity in supervised learning. In: Proceedings of the 30th International Conference on Neural Information Processing Systems, pp. 3315–3323
Gruber, S.G., Buettner, F.: Better uncertainty calibration via proper scores for classification and beyond. In: Oh, A.H., Agarwal, A., Belgrave, D., Cho, K. (eds.) NeurIPS 2022: Advances in Neural Information Processing Systems (2022)
Roelofs, R., Cain, N., Shlens, J., Mozer, M.C.: Mitigating bias in calibration error estimation. In: Camps-Valls, G., Ruiz, F.J.R., and Valera, I. (eds.) AISTATS 2022: The 25th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research), vol. 151, pp. 4036–4054. PMLR (2022)
Murphy, A.H.: A new vector partition of the probability score. J. Appl. Meteorol. 12(4), 595–600 (1973)
Knopman, D.S., et al.: Age and neurodegeneration imaging biomarkers in persons with Alzheimer disease dementia. Neurology 87(7), 691–698 (2016)
Dukart, J., Schroeter, M.L., Mueller, K.: Alzheimer’s disease neuroimaging initiative: age correction in dementia-matching to a healthy brain. PLoS ONE 6(7), e22193 (2011)
ten Kate, M., et al.: Impact of APOE-\(\varepsilon \)4 and family history of dementia on gray matter atrophy in cognitively healthy middle-aged adults. Neurobiol. Aging 38, 14–20 (2016)
Cacciaglia, R., et al.: Effects of APOE-\(\varepsilon \)4 allele load on brain morphology in a cohort of middle-aged healthy individuals with enriched genetic risk for Alzheimer’s disease. Alzheimers Dement. 14(7), 902–912 (2018)
Stern, Y.: Cognitive reserve. Neuropsychologia 47(10), 2015–2028 (2009)
Buolamwini, J., Gebru, T.: Gender shades: intersectional accuracy disparities in commercial gender classification. In: Proceedings of Machine Learning Research, vol. 81, pp. 77–91 (2018)
Seyyed-Kalantari, L., et al.: Underdiagnosis bias of artificial intelligence algorithms applied to chest radiographs in under-served patient populations. Nat. Med. 27, 2176–2182 (2021)
Acknowledgements
VND, JHG, and KL received funding from the EU’s Horizon 2020 research and innovation programme under Grant Agreement No 848158, EarlyCause, and MPAS under Grant Agreement No 952103, EuCanImage. AC received funding from Ministry of Universities and Recovery, Transformation and Resilience Plan, through UPC (Grant No 2021UPC-MS-67573). JHG is a Serra Húnter fellow.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Dang, V.N. et al. (2023). Auditing Unfair Biases in CNN-Based Diagnosis of Alzheimer’s Disease. In: Wesarg, S., et al. Clinical Image-Based Procedures, Fairness of AI in Medical Imaging, and Ethical and Philosophical Issues in Medical Imaging. CLIP EPIMI FAIMI 2023 2023 2023. Lecture Notes in Computer Science, vol 14242. Springer, Cham. https://doi.org/10.1007/978-3-031-45249-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-031-45249-9_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-45248-2
Online ISBN: 978-3-031-45249-9
eBook Packages: Computer ScienceComputer Science (R0)