Abstract
The chest X-ray (CXR) is one of the most commonly available radiological examinations for identifying chest diseases. The application of deep learning methods in computer vision is becoming more and more mature, it provides new methods for automatic analysis of medical images and assisting doctors in high-precision intelligent diagnosis. In this paper, we propose a dual attention network to identify cardiomegaly (CXRDANet) on CXR images. CXRDANet is equipped with channel attention module (CAM) and spatial attention module (SAM), which selectively enhance features highly related to lesion area. We select CXR images of cardiomegaly and normal from ChestX-ray14 and NLM-CXR, without overlapping images, as the training set and the test set. Experimental results show that our method attains the accuracy of 0.9050, the sensitivity of 0.9445, the specificity of 0.8610, the F1 score of 0.9059, the AUC of 0.9588, which is a new state-of-the-art performance. In addition, we apply our method to the multi-label CXR image classification, and its performance has reached an excellent level.
Similar content being viewed by others
References
Zhou S, Zhang X, Zhang R (2019) Identifying cardiomegaly in chestx-ray8 using transfer learning. Stud Health Technol Inf 264:482–486
Rajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, Ding D, Bagul A, Langlotz C, Shpanskaya K, et al. (2017) Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv:1711.05225
Abiyev RH, Ma’aitah MKS (2018) Deep convolutional neural networks for chest diseases detection. J Healthcare Eng 2018:1–11
Shen D, Wu G, Suk H-I (2017) Deep learning in medical image analysis. Ann Rev Biomed Eng 19:221–248
Ker J, Wang L, Rao J, Lim T (2017) Deep learning applications in medical image analysis. IEEE Access 6:9375–9389
Kadam VJ, Jadhav SM, Vijayakumar K (2019) Breast cancer diagnosis using feature ensemble learning based on stacked sparse autoencoders and softmax regression. J Med Syst 43(8):263
Rocha J, Cunha A, Mendonça AM (2020) Conventional filtering versus u-net based models for pulmonary nodule segmentation in ct images. J Med Syst 44(4):1–8
Doshi D, Shenoy A, Sidhpura D, Gharpure P (2016) Diabetic retinopathy detection using deep convolutional neural networks. In: 2016 International Conference on Computing, Analytics and Security Trends (CAST). IEEE, pp 261–266
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Huang G, Liu Zx, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
Wang X, Peng Y, Lu L, Lu Zx, Bagheri M, Summers RM (2017) Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2097–2106
Irvin J, Rajpurkar P, Ko M, Yu Y, Ciurea-Ilcus S, Chute C, Marklund H, Haghgoo B, Ball R, Shpanskaya K et al (2019) Chexpert: A large chest radiograph dataset with uncertainty labels and expert comparison. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 33, pp 590–597
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Candemir S, Rajaraman S, Thoma G, Antani S (2018) Deep learning for grading cardiomegaly severity in chest x-rays: an investigation. In: 2018 IEEE Life Sciences Conference (LSC). IEEE, pp 109–113
Xie S, Girshick R, Dollár P, Tu Z, He K (2017) Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1492–1500
Howard A, Sandler M, Chu G, Chen L-C, Chen B, Tan M, Wang W, Zhu Y, Pang R, Vasudevan V et al (2019) Searching for mobilenetv3. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 1314–1324
Li X, Hu X, Yang J (2019) Spatial group-wise enhance: Improving semantic feature learning in convolutional networks. arXiv:1905.09646
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
Wang Q, Wu B, Zhu P, Li P, Zuo W, Hu Q (2020) Eca-net: Efficient channel attention for deep convolutional neural networks, 2020 IEEE. In: CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE
Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803
Li X, Wang W, Hu X, Yang J (2019) Selective kernel networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 510–519
Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 3146–3154
Tang Y, Wang X, Harrison AP, Lu L, Xiao J, Summers RM (2018) Attention-guided curriculum learning for weakly supervised classification and localization of thoracic diseases on chest radiographs. In: International Workshop on Machine Learning in Medical Imaging. Springer, pp 249–258
Guan Q, Huang Y, Zhong Z, Zheng Z, Zheng L, Yang Y (2020) Thorax disease classification with attention guided convolutional neural network. Pattern Recogn Lett 131:38–45
Yao L, Poblenz E, Dagunts D, Covington B, Bernard D, Lyman K (2017) Learning to diagnose from scratch by exploiting dependencies among labels. arXiv:1710.10501
Ma Y, Zhou Q, Chen X, Lu H, Zhao Y (2019) Multi-attention network for thoracic disease classification and localization. In: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1378– 1382
Hou Q, Zhang L, Cheng M-M, Feng J (2020) Strip pooling: Rethinking spatial pooling for scene parsing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4003–4012
Zhang X, Zhou X, Lin M, Sun J (2018) Shufflenet: An extremely efficient convolutional neural network for mobile devices. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6848–6856
Lin T-Y, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
Goyal P, Dollár P, Girshick R, Noordhuis P, Wesolowski L, Kyrola A, Tulloch A, Jia Y, He K (2017) Accurate, large minibatch sgd: Training imagenet in 1 hour. arXiv:1706.02677
He T, Zhang Z, Zhang H, Zhang Z, Xie J, Li M (2019) Bag of tricks for image classification with convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 558–567
Agnes SA, Anitha J, Pandian SIA, Peter JD (2020) Classification of mammogram images using multiscale all convolutional neural network (ma-cnn). J Med Syst 44(1):30
Guendel S, Grbic S, Georgescu B, Liu S, Maier A, Comaniciu D (2018) Learning to recognize abnormalities in chest x-rays with location-aware dense networks. In: Iberoamerican Congress on Pattern Recognition. Springer, pp 757–765
Guan Q, Huang Y (2020) Multi-label chest x-ray image classification via category-wise residual attention learning. Pattern Recogn Lett 130:259–266
Frishman WH, Nadelmann J, Ooi WL, Greenberg S, Heiman M, Kahn S, Guzik H, Lazar EJ, Aronson Miriam (1992) Cardiomegaly on chest x-ray: prognostic implications from a ten-year cohort study of elderly subjects: a report from the bronx longitudinal aging study. Amer Heart J 124(4):1026–1030
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2017) Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision, pp 618–626
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethical Approval
This article does not contain any studies with human participants or animals performed by any of the authors.
In Case Animals Were Involved
Ethical Approval Animals were not involved.
And/or in Case Humans Were Involved
Ethical Approval This article does not contain any studies with human participants performed by any of the authors.
Competing of Interests
The authors have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chen, L., Mao, T. & Zhang, Q. Identifying cardiomegaly in chest x-rays using dual attention network. Appl Intell 52, 11058–11067 (2022). https://doi.org/10.1007/s10489-021-02935-w
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02935-w