Abstract
In highly regulated areas such as healthcare there is a demand for explainable and trustworthy systems that are capable of providing some sort of foundation or logical reasoning to their functionality. Therefore, deep learning applications associated with such industry are increasingly required by this sense of accountability regarding their production value. Additionally, it is of utter importance to take advantage of all possible data resources, in order to achieve a greater amount of efficiency respecting such intelligent frameworks, while maintaining a realistic medical scenario. As a way to explore this issue, we propose two models trained with information retained in chest radiographs and regularized by the associated medical reports. We argue that the knowledge extracted from the free-radiology text, in a multimodal training context, promotes more coherence, leading to better decisions and interpretability saliency maps. Our proposed approach demonstrated to be more robust than their baseline counterparts, showing better classification performances, and also ensuring more concise, consistent and less dispersed saliency maps. Our proof-of-concept experiments were done using the publicly available multimodal radiology dataset MIMIC-CXR that contains a myriad of chest X-rays and its correspondent free-text reports.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Baltrušaitis, T., Ahuja, C., Morency, L.P.: Multimodal machine learning: a survey and taxonomy. IEEE Trans. Pattern Anal. Mach. Intell. 41(2), 423–443 (2018)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2019)
Goldberger, A., et al.: Components of a new research resource for complex physiologic signals. PhysioNet 101 (2000)
Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks (2018)
Johnson, A.E.W., et al.: MIMIC-CXR-JPG: a large publicly available database of labeled chest radiographs. CoRR abs/1901.07042 (2019). http://arxiv.org/abs/1901.07042
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Kokhlikyan, N., et al.: Captum: A unified and generic model interpretability library for PyTorch (2020)
Li, Y., Tian, S., Huang, Y., Dong, W.: Driverless artificial intelligence framework for the identification of malignant pleural effusion. Transl. Oncol. 14(1), 100896 (2021). https://doi.org/10.1016/j.tranon.2020.100896. https://www.sciencedirect.com/science/article/pii/S1936523320303880
Lucieri, A., Dengel, A., Ahmed, S.: Deep learning based decision support for medicine - a case study on skin cancer diagnosis (2021)
Shrikumar, A., Greenside, P., Kundaje, A.: Learning important features through propagating activation differences. In: International Conference on Machine Learning, pp. 3145–3153. PMLR (2017)
Silva, W., Poellinger, A., Cardoso, J.S., Reyes, M.: Interpretability-guided content-based medical image retrieval. In: Martel, A.L., et al. (eds.) MICCAI 2020. LNCS, vol. 12261, pp. 305–314. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59710-8_30
Tjoa, E., Guan, C.: A survey on explainable artificial intelligence (XAI): toward medical XAI. IEEE Trans. Neural Netw. Learn. Syst. 32(11), 4793–4813 (2021)
Yu, Y., Hu, P., Lin, J., Krishnaswamy, P.: Multimodal multitask deep learning for X-ray image retrieval. In: de Bruijne, M., et al. (eds.) MICCAI 2021. LNCS, vol. 12905, pp. 603–613. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87240-3_58
Acknowledgement
This work was partially funded by the Project TAMI - Transparent Artificial Medical Intelligence (NORTE-01-0247-FEDER-045905) financed by ERDF - European Regional Fund through the North Portugal Regional Operational Program - NORTE 2020 and by the Portuguese Foundation for Science and Technology - FCT under the CMU - Portugal International Partnership, and also by the Portuguese Foundation for Science and Technology - FCT within PhD grant number SFRH/BD/139468/2018.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Mata, D., Silva, W., Cardoso, J.S. (2022). Increased Robustness in Chest X-Ray Classification Through Clinical Report-Driven Regularization. In: Pinho, A.J., Georgieva, P., Teixeira, L.F., Sánchez, J.A. (eds) Pattern Recognition and Image Analysis. IbPRIA 2022. Lecture Notes in Computer Science, vol 13256. Springer, Cham. https://doi.org/10.1007/978-3-031-04881-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-04881-4_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-04880-7
Online ISBN: 978-3-031-04881-4
eBook Packages: Computer ScienceComputer Science (R0)