Skip to main content

Increased Robustness in Chest X-Ray Classification Through Clinical Report-Driven Regularization

  • Conference paper
  • First Online:
Pattern Recognition and Image Analysis (IbPRIA 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13256))

Included in the following conference series:

Abstract

In highly regulated areas such as healthcare there is a demand for explainable and trustworthy systems that are capable of providing some sort of foundation or logical reasoning to their functionality. Therefore, deep learning applications associated with such industry are increasingly required by this sense of accountability regarding their production value. Additionally, it is of utter importance to take advantage of all possible data resources, in order to achieve a greater amount of efficiency respecting such intelligent frameworks, while maintaining a realistic medical scenario. As a way to explore this issue, we propose two models trained with information retained in chest radiographs and regularized by the associated medical reports. We argue that the knowledge extracted from the free-radiology text, in a multimodal training context, promotes more coherence, leading to better decisions and interpretability saliency maps. Our proposed approach demonstrated to be more robust than their baseline counterparts, showing better classification performances, and also ensuring more concise, consistent and less dispersed saliency maps. Our proof-of-concept experiments were done using the publicly available multimodal radiology dataset MIMIC-CXR that contains a myriad of chest X-rays and its correspondent free-text reports.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Baltrušaitis, T., Ahuja, C., Morency, L.P.: Multimodal machine learning: a survey and taxonomy. IEEE Trans. Pattern Anal. Mach. Intell. 41(2), 423–443 (2018)

    Article  Google Scholar 

  2. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2019)

    Google Scholar 

  3. Goldberger, A., et al.: Components of a new research resource for complex physiologic signals. PhysioNet 101 (2000)

    Google Scholar 

  4. Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks (2018)

    Google Scholar 

  5. Johnson, A.E.W., et al.: MIMIC-CXR-JPG: a large publicly available database of labeled chest radiographs. CoRR abs/1901.07042 (2019). http://arxiv.org/abs/1901.07042

  6. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

  7. Kokhlikyan, N., et al.: Captum: A unified and generic model interpretability library for PyTorch (2020)

    Google Scholar 

  8. Li, Y., Tian, S., Huang, Y., Dong, W.: Driverless artificial intelligence framework for the identification of malignant pleural effusion. Transl. Oncol. 14(1), 100896 (2021). https://doi.org/10.1016/j.tranon.2020.100896. https://www.sciencedirect.com/science/article/pii/S1936523320303880

  9. Lucieri, A., Dengel, A., Ahmed, S.: Deep learning based decision support for medicine - a case study on skin cancer diagnosis (2021)

    Google Scholar 

  10. Shrikumar, A., Greenside, P., Kundaje, A.: Learning important features through propagating activation differences. In: International Conference on Machine Learning, pp. 3145–3153. PMLR (2017)

    Google Scholar 

  11. Silva, W., Poellinger, A., Cardoso, J.S., Reyes, M.: Interpretability-guided content-based medical image retrieval. In: Martel, A.L., et al. (eds.) MICCAI 2020. LNCS, vol. 12261, pp. 305–314. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59710-8_30

  12. Tjoa, E., Guan, C.: A survey on explainable artificial intelligence (XAI): toward medical XAI. IEEE Trans. Neural Netw. Learn. Syst. 32(11), 4793–4813 (2021)

    Google Scholar 

  13. Yu, Y., Hu, P., Lin, J., Krishnaswamy, P.: Multimodal multitask deep learning for X-ray image retrieval. In: de Bruijne, M., et al. (eds.) MICCAI 2021. LNCS, vol. 12905, pp. 603–613. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87240-3_58

Download references

Acknowledgement

This work was partially funded by the Project TAMI - Transparent Artificial Medical Intelligence (NORTE-01-0247-FEDER-045905) financed by ERDF - European Regional Fund through the North Portugal Regional Operational Program - NORTE 2020 and by the Portuguese Foundation for Science and Technology - FCT under the CMU - Portugal International Partnership, and also by the Portuguese Foundation for Science and Technology - FCT within PhD grant number SFRH/BD/139468/2018.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Diogo Mata .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mata, D., Silva, W., Cardoso, J.S. (2022). Increased Robustness in Chest X-Ray Classification Through Clinical Report-Driven Regularization. In: Pinho, A.J., Georgieva, P., Teixeira, L.F., Sánchez, J.A. (eds) Pattern Recognition and Image Analysis. IbPRIA 2022. Lecture Notes in Computer Science, vol 13256. Springer, Cham. https://doi.org/10.1007/978-3-031-04881-4_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-04881-4_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-04880-7

  • Online ISBN: 978-3-031-04881-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics