Abstract
Respiratory diseases are the leading cause of many hospital admissions and account for a significant portion of fatalities each year in Europe. Long-term respiratory conditions such as asthma affect millions of people globally. A crucial element in diagnosing and monitoring respiratory disease is assessing lung sounds known as respiratory auscultation. Nevertheless, this process can be automated using Deep Learning (DL) techniques to alleviate the strain on healthcare services. This work offers a comparison of various State-Of-The-Art DL models’, namely ConvNeXt, and Vision and Swin transformers for predicting respiratory diseases asthma and COPD and healthy controls from a novel dataset of lung sound recordings represented by melspectrograms. The research concludes that using ConvNeXt in its’ Base configuration outperforms other networks with metrics including accuracy, sensitivity, precision, specificity and F1 score.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Åström, C., Orru, H., Rocklöv, J., Strandberg, G., Ebi, K.L., Forsberg, B.: Heat-related respiratory hospital admissions in Europe in a changing climate: a health impact assessment. BMJ Open 3(1), e001842 (2013)
Basu, V., Rana, S.: Respiratory diseases recognition through respiratory sound with the help of deep neural network. In: 2020 4th International Conference on Computational Intelligence and Networks (CINE), pp. 1–6 (2020)
Chambres, G., Hanna, P., Desainte-Catherine, M.: Automatic detection of patient with respiratory diseases using lung sound analysis. In: 2018 International Conference on Content-Based Multimedia Indexing (CBMI), pp. 1–6 (2018)
Chanane, H., Bahoura, M.: Convolutional neural network-based model for lung sounds classification. In: 2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS), pp. 555–558 (2021)
Chang, Y., Ren, Z., Schuller, B.W.: Transformer-based CNNs: mining temporal context information for multi-sound COVID-19 diagnosis. In: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), pp. 2335–2338 (2021)
Chollet, F.: Xception: deep learning with depthwise separable convolutiond, pp. 1251–1258 (2017). arXiv: arXiv:1610.02357
Dosovitskiy, A., et al.: An image is worth 16 \(\times \) 16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Fraiwan, M., Fraiwan, L., Alkhodari, M., Hassanin, O.: Recognition of pulmonary diseases from lung sounds using convolutional neural networks and long short-term memory. J. Ambient Intell. Humaniz. Comput. 1–13 (2022)
Fraiwan, M., Fraiwan, L., Khassawneh, B., Ibnian, A.: A dataset of lung sounds recorded from the chest wall using an electronic stethoscope. Data Brief 35, 106913 (2021)
Gairola, S., Tom, F., Kwatra, N., Jain, M.: Respirenet: a deep neural network for accurately detecting abnormal lung sounds in limited data setting. In: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), pp. 527–530 (2021)
Habashy, K., et al.: Cough classification using audio spectrogram transformer. In: 2022 IEEE Sensors Applications Symposium (SAS), pp. 1–6 ( 2022)
Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. CoRR abs/2103.14030 (2021), https://arxiv.org/abs/2103.14030
Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11966–11976 (2022)
Müller, V., et al.: Characteristics of reversible and nonreversible copd and asthma and copd overlap syndrome patients: an analysis of salbutamol easyhaler data. Int. J. Chronic Obstr. Pulm. Dis. 93–101 (2016)
Nguyen, T., Pernkopf, F.: Lung sound classification using snapshot ensemble of convolutional neural networks. In: 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), pp. 760–763 (2020)
Nguyen, T., Pernkopf, F.: Lung sound classification using co-tuning and stochastic normalization. IEEE Trans. Biomed. Eng. 69(9), 2872–2882 (2022)
Perna, D., Tagarelli, A.: Deep auscultation: predicting respiratory anomalies and diseases via recurrent neural networks. In: 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS), pp. 50–55 (2019)
Pham, L., McLoughlin, I., Phan, H., Tran, M., Nguyen, T., Palaniappan, R.: Robust deep learning framework for predicting respiratory anomalies and diseases. In: 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), pp. 164–167 (2020)
Pham, L., Phan, H., Palaniappan, R., Mertins, A., McLoughlin, I.: CNN-MoE based framework for classification of respiratory anomalies and lung disease detection. IEEE J. Biomed. Health Inform. 25(8), 2938–2947 (2021)
Pham, L., Phan, H., Schindler, A., King, R., Mertins, A., McLoughlin, I.: Inception-based network and multi-spectrogram ensemble applied to predict respiratory anomalies and lung diseases. In: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), pp. 253–256 (2021)
Rocha, B.M., et al.: An open access database for the evaluation of respiratory sound classification algorithms. Physiol. Meas. 40(3), 035001 (2019)
Shi, L., Zhang, J., Yang, B., Gao, Y.: Lung sound recognition method based on multi-resolution interleaved net and time-frequency feature enhancement. IEEE J. Biomed. Health Inform. 27(10), 4768–4779 (2023)
WHO: The top 10 causes of death (2020). https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death. Accessed 04 Mar 2024
Wu, B., et al.: Visual transformers: token-based image representation and processing for computer vision (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Burrows, H., Oghaz, M.M., Saheer, L.B. (2025). Respiratory Disease Detection Using Deep Convolutional Transformer Models. In: Bramer, M., Stahl, F. (eds) Artificial Intelligence XLI. SGAI 2024. Lecture Notes in Computer Science(), vol 15447. Springer, Cham. https://doi.org/10.1007/978-3-031-77918-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-031-77918-3_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-77917-6
Online ISBN: 978-3-031-77918-3
eBook Packages: Computer ScienceComputer Science (R0)