skip to main content
10.1145/3670085.3670088acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmaiConference Proceedingsconference-collections
research-article

COVIDTran: an automated COVID-19 diagnosis system via Context Transfer Transformer

Published: 22 August 2024 Publication History

Abstract

In this paper we present COVIDTran, an automated COVID diagnostic system that takes symptomatic cough audios as input and identifies potential cases of COVID19. Adopting principles from Transfer Learning, we implement neural network based on Vision Transformer that processes the spectrographic maps of the cough audio signals, and promote the robustness of our model by integrating contextual information from similar flu symptomatic datasets via transfer learning. Experimental results involving crowdsourced COVID coughing and speech datasets suggest that our strategy outperforms other current methods as measured by different metrics, thereby providing new insights on automated COVID19 diagnosis on top of existing methods.

References

[1]
Bless Lord Y Agbley, Jianping Li, Aminul Haq, Bernard Cobbinah, Delanyo Kulevome, Priscilla A Agbefu, and Bright Eleeza. 2020. Wavelet-based cough signal decomposition for multimodal classification. In 2020 17th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP). IEEE, 5–9.
[2]
Tauseef Ahmad, Mukhtiar Baig Haroon, and Jin Hui. 2020. Coronavirus disease 2019 (COVID-19) pandemic and economic impact. Pakistan journal of medical sciences 36, COVID19-S4 (2020), S73.
[3]
Md. Manjurul Ahsan and Zahed Siddique. 2021. Machine learning based disease diagnosis: A comprehensive review. CoRR abs/2112.15538 (2021). arXiv:2112.15538https://arxiv.org/abs/2112.15538
[4]
Saira Baloch, Mohsin Ali Baloch, Tianli Zheng, and Xiaofang Pei. 2020. The coronavirus disease 2019 (COVID-19) pandemic. The Tohoku journal of experimental medicine 250, 4 (2020), 271–278.
[5]
Nick J Beeching, Tom E Fletcher, and Mike BJ Beadsworth. 2020. Covid-19: testing times.
[6]
Feng-Ju Chang, Jing Liu, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo, Ariya Rastrow, and Siegfried Kunzmann. 2021. Context-Aware Transformer Transducer for Speech Recognition. CoRR abs/2111.03250 (2021). arXiv:2111.03250https://arxiv.org/abs/2111.03250
[7]
Panda G Satapathy SC Dash TK, Mishra S. 2021. Detection of COVID-19 from speech signal using bio-inspired based cepstral features. Pattern recognition (09 2021), 117. https://doi.org/
[8]
Vamsidhar Enireddy, Mathe John Kenny Kumar, Babitha Donepudi, and C Karthikeyan. 2020. Detection of COVID-19 using Hybrid ResNet and SVM. IOP Conference Series: Materials Science and Engineering 993, 1 (dec 2020), 012046. https://doi.org/10.1088/1757-899x/993/1/012046
[9]
Xiaohong W. Gao, Rui Hui, and Zengmin Tian. 2017. Classification of CT brain images based on deep learning networks. Computer Methods and Programs in Biomedicine 138 (2017), 49–56. https://doi.org/10.1016/j.cmpb.2016.10.007
[10]
Xu Gaozheng, Fang Zixuan, and Wang Lifang. 2020. Research on SVM Fault Diagnosis Method Based on Text Feature Extraction Algorithm. In 2020 International Conference on Sensing, Diagnostics, Prognostics, and Control (SDPC). 199–203. https://doi.org/10.1109/SDPC49476.2020.9353155
[11]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-term Memory. Neural computation 9 (12 1997), 1735–80. https://doi.org/10.1162/neco.1997.9.8.1735
[12]
Md Zabirul Islam, Md Milon Islam, and Amanullah Asraf. 2020. A combined deep CNN-LSTM network for the detection of novel coronavirus (COVID-19) using X-ray images. Informatics in medicine unlocked 20 (2020), 100412.
[13]
Matthew M Kavanagh, Ngozi A Erondu, Oyewale Tomori, Victor J Dzau, Emelda A Okiro, Allan Maleche, Ifeyinwa C Aniebo, Umunya Rugege, Charles B Holmes, and Lawrence O Gostin. 2020. Access to lifesaving medical resources for African countries: COVID-19 testing and response, ethics, and politics. The Lancet 395, 10238 (2020), 1735–1738.
[14]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems, F. Pereira, C. J. C. Burges, L. Bottou, and K. Q. Weinberger (Eds.). Vol. 25. Curran Associates, Inc.
[15]
Jimmy Lin, Rodrigo Nogueira, and Andrew Yates. 2020. Pretrained Transformers for Text Ranking: BERT and Beyond. https://doi.org/10.48550/ARXIV.2010.06467
[16]
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 10012–10022.
[17]
Alexander Selvikvåg Lundervold and Arvid Lundervold. 2019. An overview of deep learning in medical imaging focusing on MRI. Zeitschrift für Medizinische Physik 29, 2 (2019), 102–127.
[18]
Elisabeth Mahase. 2020. Covid-19: Mass testing is inaccurate and gives false sense of security, minister admits.
[19]
Masaki Makino, Ryo Yoshimoto, Masaki Ono, Toshinari Itoko, Takayuki Katsuki, Akira Koseki, Michiharu Kudo, Kyoichi Haida, Jun Kuroda, Ryosuke Yanagiya, 2019. Artificial intelligence predicts the progression of diabetic kidney disease using big data machine learning. Scientific reports 9, 1 (2019), 1–9.
[20]
Lara Orlandic, Tomas Teijeiro, and David Atienza. 2021. The COUGHVID crowdsourcing dataset, a corpus for the study of large-scale cough analysis algorithms. Scientific Data 8, 1 (2021), 1–10.
[21]
Karol J. Piczak. 2015. ESC: Dataset for Environmental Sound Classification. In Proceedings of the 23rd ACM International Conference on Multimedia (Brisbane, Australia) (MM ’15). Association for Computing Machinery, New York, NY, USA, 1015–1018. https://doi.org/10.1145/2733373.2806390
[22]
Matteo Polsinelli, Luigi Cinque, and Giuseppe Placidi. 2020. A light CNN for detecting COVID-19 from CT scans of the chest. Pattern recognition letters 140 (2020), 95–100.
[23]
Ramesh Chandra Poonia, Mukesh Kumar Gupta, Ibrahim Abunadi, Amani Abdulrahman Albraikan, Fahd N. Al-Wesabi, Manar Ahmed Hamza, and Tulasi B. 2022. Intelligent Diagnostic Prediction and Classification Models for Detection of Kidney Disease. Healthcare 10, 2 (2022). https://www.mdpi.com/2227-9032/10/2/371
[24]
Hasib-Al Rashid, Arnab Neelim Mazumder, Utteja Panchakshara Kallakuri Niyogi, and Tinoosh Mohsenin. 2021. CoughNet: A Flexible Low Power CNN-LSTM Processor for Cough Sound Detection. In 2021 IEEE 3rd International Conference on Artificial Intelligence Circuits and Systems (AICAS). 1–4. https://doi.org/10.1109/AICAS51828.2021.9458509
[25]
José C. Segura, M.C. Benítez, Ángel de la Torre, and Antonio J. Rubio. 2002. Feature extraction combining spectral noise reduction and cepstral histogram equalization for robust ASR. In Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002). 225–228.
[26]
Neeraj Sharma, Prashant Krishnan, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Prasanta Kumar Ghosh, Sriram Ganapathy, 2020. Coswara–a database of breathing, cough, and voice sounds for COVID-19 diagnosis. arXiv preprint arXiv:2005.10548 (2020).
[27]
Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. https://doi.org/10.48550/ARXIV.1409.1556
[28]
Myoung-Jin Son and Seok-Pil Lee. 2022. COVID-19 Diagnosis from Crowdsourced Cough Sound Data. Applied Sciences 12, 4 (2022). https://doi.org/10.3390/app12041795
[29]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention Is All You Need. CoRR abs/1706.03762 (2017). arXiv:1706.03762http://arxiv.org/abs/1706.03762
[30]
Chengyi Wang, Yu Wu, Yao Qian, Kenichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, and Xuedong Huang. 2021. UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data. (01 2021).
[31]
Zhou G Xu L Liu Y. Wang D, Mo J. 2020. Research on SVM Fault Diagnosis Method Based on Text Feature Extraction Algorithm. In PLoS One. https://doi.org/
[32]
Fuzhen Zhuang, Zhiyuan Qi, Keyu Duan, Dongbo Xi, Yongchun Zhu, Hengshu Zhu, Hui Xiong, and Qing He. 2019. A Comprehensive Survey on Transfer Learning. CoRR abs/1911.02685 (2019). arXiv:1911.02685http://arxiv.org/abs/1911.02685
[33]
Erdi Çallı, Ecem Sogancioglu, Bram van Ginneken, Kicky G. van Leeuwen, and Keelin Murphy. 2021. Deep learning for chest X-ray analysis: A survey. Medical Image Analysis 72 (2021), 102125. https://doi.org/10.1016/j.media.2021.102125

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICMAI '24: Proceedings of the 2024 9th International Conference on Mathematics and Artificial Intelligence
May 2024
134 pages
ISBN:9798400717284
DOI:10.1145/3670085
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 August 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Audio and Speech Classification
  2. Deep Learning
  3. Transfer Learning

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICMAI 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 23
    Total Downloads
  • Downloads (Last 12 months)23
  • Downloads (Last 6 weeks)9
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media