Skip to main content
Log in

Identifying cardiomegaly in chest x-rays using dual attention network

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

The chest X-ray (CXR) is one of the most commonly available radiological examinations for identifying chest diseases. The application of deep learning methods in computer vision is becoming more and more mature, it provides new methods for automatic analysis of medical images and assisting doctors in high-precision intelligent diagnosis. In this paper, we propose a dual attention network to identify cardiomegaly (CXRDANet) on CXR images. CXRDANet is equipped with channel attention module (CAM) and spatial attention module (SAM), which selectively enhance features highly related to lesion area. We select CXR images of cardiomegaly and normal from ChestX-ray14 and NLM-CXR, without overlapping images, as the training set and the test set. Experimental results show that our method attains the accuracy of 0.9050, the sensitivity of 0.9445, the specificity of 0.8610, the F1 score of 0.9059, the AUC of 0.9588, which is a new state-of-the-art performance. In addition, we apply our method to the multi-label CXR image classification, and its performance has reached an excellent level.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Zhou S, Zhang X, Zhang R (2019) Identifying cardiomegaly in chestx-ray8 using transfer learning. Stud Health Technol Inf 264:482–486

    Google Scholar 

  2. Rajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, Ding D, Bagul A, Langlotz C, Shpanskaya K, et al. (2017) Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv:1711.05225

  3. Abiyev RH, Ma’aitah MKS (2018) Deep convolutional neural networks for chest diseases detection. J Healthcare Eng 2018:1–11

    Article  Google Scholar 

  4. Shen D, Wu G, Suk H-I (2017) Deep learning in medical image analysis. Ann Rev Biomed Eng 19:221–248

    Article  Google Scholar 

  5. Ker J, Wang L, Rao J, Lim T (2017) Deep learning applications in medical image analysis. IEEE Access 6:9375–9389

    Article  Google Scholar 

  6. Kadam VJ, Jadhav SM, Vijayakumar K (2019) Breast cancer diagnosis using feature ensemble learning based on stacked sparse autoencoders and softmax regression. J Med Syst 43(8):263

    Article  Google Scholar 

  7. Rocha J, Cunha A, Mendonça AM (2020) Conventional filtering versus u-net based models for pulmonary nodule segmentation in ct images. J Med Syst 44(4):1–8

    Article  Google Scholar 

  8. Doshi D, Shenoy A, Sidhpura D, Gharpure P (2016) Diabetic retinopathy detection using deep convolutional neural networks. In: 2016 International Conference on Computing, Analytics and Security Trends (CAST). IEEE, pp 261–266

  9. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  10. Huang G, Liu Zx, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708

  11. Wang X, Peng Y, Lu L, Lu Zx, Bagheri M, Summers RM (2017) Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2097–2106

  12. Irvin J, Rajpurkar P, Ko M, Yu Y, Ciurea-Ilcus S, Chute C, Marklund H, Haghgoo B, Ball R, Shpanskaya K et al (2019) Chexpert: A large chest radiograph dataset with uncertainty labels and expert comparison. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 33, pp 590–597

  13. Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9

  14. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105

    Google Scholar 

  15. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556

  16. Candemir S, Rajaraman S, Thoma G, Antani S (2018) Deep learning for grading cardiomegaly severity in chest x-rays: an investigation. In: 2018 IEEE Life Sciences Conference (LSC). IEEE, pp 109–113

  17. Xie S, Girshick R, Dollár P, Tu Z, He K (2017) Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1492–1500

  18. Howard A, Sandler M, Chu G, Chen L-C, Chen B, Tan M, Wang W, Zhu Y, Pang R, Vasudevan V et al (2019) Searching for mobilenetv3. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 1314–1324

  19. Li X, Hu X, Yang J (2019) Spatial group-wise enhance: Improving semantic feature learning in convolutional networks. arXiv:1905.09646

  20. Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141

  21. Wang Q, Wu B, Zhu P, Li P, Zuo W, Hu Q (2020) Eca-net: Efficient channel attention for deep convolutional neural networks, 2020 IEEE. In: CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE

  22. Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803

  23. Li X, Wang W, Hu X, Yang J (2019) Selective kernel networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 510–519

  24. Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 3146–3154

  25. Tang Y, Wang X, Harrison AP, Lu L, Xiao J, Summers RM (2018) Attention-guided curriculum learning for weakly supervised classification and localization of thoracic diseases on chest radiographs. In: International Workshop on Machine Learning in Medical Imaging. Springer, pp 249–258

  26. Guan Q, Huang Y, Zhong Z, Zheng Z, Zheng L, Yang Y (2020) Thorax disease classification with attention guided convolutional neural network. Pattern Recogn Lett 131:38–45

    Article  Google Scholar 

  27. Yao L, Poblenz E, Dagunts D, Covington B, Bernard D, Lyman K (2017) Learning to diagnose from scratch by exploiting dependencies among labels. arXiv:1710.10501

  28. Ma Y, Zhou Q, Chen X, Lu H, Zhao Y (2019) Multi-attention network for thoracic disease classification and localization. In: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1378– 1382

  29. Hou Q, Zhang L, Cheng M-M, Feng J (2020) Strip pooling: Rethinking spatial pooling for scene parsing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4003–4012

  30. Zhang X, Zhou X, Lin M, Sun J (2018) Shufflenet: An extremely efficient convolutional neural network for mobile devices. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6848–6856

  31. Lin T-Y, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988

  32. Goyal P, Dollár P, Girshick R, Noordhuis P, Wesolowski L, Kyrola A, Tulloch A, Jia Y, He K (2017) Accurate, large minibatch sgd: Training imagenet in 1 hour. arXiv:1706.02677

  33. He T, Zhang Z, Zhang H, Zhang Z, Xie J, Li M (2019) Bag of tricks for image classification with convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 558–567

  34. Agnes SA, Anitha J, Pandian SIA, Peter JD (2020) Classification of mammogram images using multiscale all convolutional neural network (ma-cnn). J Med Syst 44(1):30

    Article  Google Scholar 

  35. Guendel S, Grbic S, Georgescu B, Liu S, Maier A, Comaniciu D (2018) Learning to recognize abnormalities in chest x-rays with location-aware dense networks. In: Iberoamerican Congress on Pattern Recognition. Springer, pp 757–765

  36. Guan Q, Huang Y (2020) Multi-label chest x-ray image classification via category-wise residual attention learning. Pattern Recogn Lett 130:259–266

    Article  Google Scholar 

  37. Frishman WH, Nadelmann J, Ooi WL, Greenberg S, Heiman M, Kahn S, Guzik H, Lazar EJ, Aronson Miriam (1992) Cardiomegaly on chest x-ray: prognostic implications from a ten-year cohort study of elderly subjects: a report from the bronx longitudinal aging study. Amer Heart J 124(4):1026–1030

    Article  Google Scholar 

  38. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2017) Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision, pp 618–626

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lifang Chen.

Ethics declarations

Ethical Approval

This article does not contain any studies with human participants or animals performed by any of the authors.

In Case Animals Were Involved

Ethical Approval Animals were not involved.

And/or in Case Humans Were Involved

Ethical Approval This article does not contain any studies with human participants performed by any of the authors.

Competing of Interests

The authors have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, L., Mao, T. & Zhang, Q. Identifying cardiomegaly in chest x-rays using dual attention network. Appl Intell 52, 11058–11067 (2022). https://doi.org/10.1007/s10489-021-02935-w

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-021-02935-w

Keywords

Navigation