research-article

Artificial Intelligence-based Speech Signal for COVID-19 Diagnostics

Authors:
Aseel Alfaidi

Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Saudi Arabia

Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Saudi Arabia

0000-0002-1855-498X
View Profile

,
Abdullah Alshahrani

Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Saudi Arabia

Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Saudi Arabia

0000-0002-5988-888X
View Profile

,
Maha Aljohani

Department of Software Engineering, College of Computer Science and Engineering, University of Jeddah, Saudi Arabia

Department of Software Engineering, College of Computer Science and Engineering, University of Jeddah, Saudi Arabia

0000-0002-1097-3311
View Profile

ICFNDS '22: Proceedings of the 6th International Conference on Future Networks & Distributed SystemsDecember 2022Pages 311–317https://doi.org/10.1145/3584202.3584247

Published:09 May 2023Publication History

ICFNDS '22: Proceedings of the 6th International Conference on Future Networks & Distributed Systems

Pages 311–317

ABSTRACT

The speech signal has numerous features that represent the characteristics of a specific language and recognize emotions. It also contains information that can be used to identify the mental, psychological, and physical states of the speaker. Recently, the acoustic analysis of speech signals offers a practical, automated, and scalable method for medical diagnosis and monitoring symptoms of many diseases. In this paper, we explore the deep acoustic features from confirmed positive and negative cases of COVID-19 and compare the performance of the acoustic features and COVID-19 symptoms in terms of their ability to diagnose COVID-19. The proposed methodology consists of the pre-trained Visual Geometry Group (VGG-16) model based on Mel spectrogram images to extract deep audio features. In addition to the K-means algorithm that determines effective features, followed by a Genetic Algorithm-Support Vector Machine (GA-SVM) classifier to classify cases. The experimental findings indicate the proposed methodology’s capability to classify COVID-19 and NOT COVID-19 from acoustic features compared to COVID-19 symptoms, achieving an accuracy of 97%. The experimental results show that the proposed method remarkably improves the accuracy of COVID-19 detection over the handcrafted features used in previous studies.

References

Musaed Alhussein and Ghulam Muhammad. 2018. Voice pathology detection using deep learning on mobile healthcare framework. IEEE Access 6(2018), 41034–41041.Google ScholarCross Ref
Lmar M Babrak, Joseph Menetski, Michael Rebhan, Giovanni Nisato, Marc Zinggeler, Noé Brasier, Katja Baerenfaller, Thomas Brenzikofer, Laurenz Baltzer, Christian Vogler, 2019. Traditional and digital biomarkers: two worlds apart?Digital biomarkers 3, 2 (2019), 92–102.Google Scholar
Andrea Coravos, Sean Khozin, and Kenneth D Mandl. 2019. Developing and adopting safe and effective digital biomarkers to improve patient outcomes. NPJ digital medicine 2, 1 (2019), 1–5.Google Scholar
Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Machine learning 20, 3 (1995), 273–297.Google ScholarDigital Library
Domenico Cucinotta and Maurizio Vanelli. 2020. WHO declares COVID-19 a pandemic. Acta Bio Medica: Atenei Parmensis 91, 1 (2020), 157.Google Scholar
Nicholas Cummins, Alice Baird, and Bjoern W Schuller. 2018. Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning. Methods 151(2018), 41–54.Google ScholarCross Ref
Guy Fagherazzi, Aurélie Fischer, Muhannad Ismael, and Vladimir Despotovic. 2021. Voice for health: the use of vocal biomarkers from research to clinical practice. Digital biomarkers 5, 1 (2021), 78–88.Google Scholar
María Teresa García-Ordás, José Alberto Benítez-Andrades, Isaías García-Rodríguez, Carmen Benavides, and Héctor Alaiz-Moretón. 2020. Detecting respiratory pathologies using convolutional neural networks and variational autoencoders for unbalancing data. Sensors 20, 4 (2020), 1214.Google ScholarCross Ref
Aurélien Géron. 2019. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. " O’Reilly Media, Inc.", Sebastopol, CA, USA.Google Scholar
Jing Han, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, and Cecilia Mascolo. 2021. Exploring automatic COVID-19 diagnosis via voice and symptoms from crowdsourced data. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 8328–8332.Google ScholarCross Ref
Sandeep Kaur and Sheetal Kalra. 2016. Disease prediction using hybrid K-means and support vector machine. In 2016 1st India international conference on information processing (IICIP). IEEE, 1–6.Google ScholarCross Ref
Murtaza Ali Khan. 2021. An automated and fast system to identify COVID-19 from X-ray radiograph of the chest using image processing and machine learning. International journal of imaging systems and technology 31, 2(2021), 499–508.Google Scholar
Siddique Latif, Junaid Qadir, Adnan Qayyum, Muhammad Usama, and Shahzad Younis. 2020. Speech technology for healthcare: Opportunities, challenges, and state of the art. IEEE Reviews in Biomedical Engineering 14 (2020), 342–356.Google ScholarCross Ref
Navin Kumar Manaswi. 2018. Deep learning with applications using python: Chatbots and Face, Object, and Speech Recognition With TensorFlow and Keras. Apress Berkeley, CA.Google Scholar
Brian McFee, Colin Raffel, Dawen Liang, Daniel P Ellis, Matt McVicar, Eric Battenberg, and Oriol Nieto. 2015. librosa: Audio and music signal analysis in python. In Proceedings of the 14th python in science conference, Vol. 8. Citeseer, 18–25.Google ScholarCross Ref
Nick Pentreath. 2015. Machine learning with spark. Packt Publishing, Birmingham.Google Scholar
Kun Qian, Xiao Li, Haifeng Li, Shengchen Li, Wei Li, Zuoliang Ning, Shuai Yu, Limin Hou, Gang Tang, Jing Lu, 2020. Computer audition for healthcare: Opportunities and challenges. Frontiers in Digital Health 2 (2020), 5.Google ScholarCross Ref
Lawrence R Rabiner, Ronald W Schafer, 2007. Introduction to digital speech processing. Foundations and Trends® in Signal Processing 1, 1–2(2007), 1–194.Google Scholar
Saloni, Rajender K Sharma, and Anil K Gupta. 2014. Disease detection using voice analysis: A review. International Journal of Medical Engineering and Informatics 6, 3(2014), 189–209.Google ScholarDigital Library
Soumya Sen, Anjan Dutta, and Nilanjan Dey. 2019. Audio processing and speech recognition: concepts, techniques and research overviews. Springer, Berlin.Google Scholar
Maksut Senbekov, Timur Saliev, Zhanar Bukeyeva, Aigul Almabayeva, Marina Zhanaliyeva, Nazym Aitenova, Yerzhan Toishibekov, and Ildar Fakhradiyev. 2020. The recent progress and applications of digital technologies in healthcare: a review. International journal of telemedicine and applications 2020 (2020).Google ScholarDigital Library
Garima Sharma, Kartikeyan Umapathy, and Sridhar Krishnan. 2020. Trends in audio signal feature extraction methods. Applied Acoustics 158(2020), 107020.Google ScholarCross Ref
Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In 3rd International Conference on Learning Representations,ICLR. 1–4.Google Scholar
Brian Stasak, Zhaocheng Huang, Sabah Razavi, Dale Joachim, and Julien Epps. 2021. Automatic detection of COVID-19 based on short-duration acoustic smartphone speech analysis. Journal of Healthcare Informatics Research 5, 2 (2021), 201–217.Google ScholarCross Ref
Mohammed Usman, Vinit Kumar Gunjan, Mohd Wajid, Mohammed Zubair, 2022. Speech as a Biomarker for COVID-19 Detection Using Machine Learning. Computational Intelligence and Neuroscience 2022 (2022).Google Scholar
Laura Verde, Giuseppe De Pietro, and Giovanna Sannino. 2021. Artificial Intelligence Techniques for the Non-invasive Detection of COVID-19 Through the Analysis of Voice Signals. Arabian Journal for Science and Engineering(2021), 1–11.Google Scholar
Pauline Vetter, Diem Lan Vu, Arnaud G L’Huillier, Manuel Schibler, Laurent Kaiser, and Frederique Jacquerioz. 2020. Clinical features of covid-19.Google Scholar
Eyal Wirsansky. 2020. Hands-on genetic algorithms with Python: applying genetic algorithms to solve real-world deep learning and artificial intelligence problems. Packt Publishing Ltd, Birmingham, U.K.Google Scholar
Tong Xia, Dimitris Spathis, J Ch, Andreas Grammenos, Jing Han, Apinan Hasthanasombat, Erika Bondareva, Ting Dang, Andres Floto, Pietro Cicuta, 2021. COVID-19 Sounds: A Large-Scale Audio Dataset for Digital Respiratory Screening. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2).Google Scholar
Laiba Zahid, Muazzam Maqsood, Mehr Yahya Durrani, Maheen Bakhtyar, Junaid Baber, Habibullah Jamal, Irfan Mehmood, and Oh-Young Song. 2020. A spectrogram-based deep feature assisted computer-aided diagnostic system for Parkinson’s disease. IEEE Access 8(2020), 35482–35495.Google ScholarCross Ref
Bichen Zheng, Sang Won Yoon, and Sarah S Lam. 2014. Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms. Expert Systems with Applications 41, 4 (2014), 1476–1482.Google ScholarDigital Library

Index Terms

Artificial Intelligence-based Speech Signal for COVID-19 Diagnostics
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Kernel methods
        Support vector machines
      2. Neural networks

Recommendations

Automated speech signal analysis based on feature extraction and classification of spasmodic dysphonia: a performance comparison of different classifiers

Spasmodic Dysphonia is a voice disorder caused due to spasm of involuntary muscles in the voice box. These spasms can leads to breathy, soundless voice breaks, strangled voice by interrupting the opening of the vocal folds. There is no specific test for ...
Read More
Speech emotion recognition based on optimized deep features of dual-channel complementary spectrogram
Abstract
Speech emotion recognition (SER) is an essential field of artificial intelligence. Although the Mel spectrogram is commonly used in SER, it emphasizes low-frequency emotional components. In this paper, we propose VMD-Teager-Mel (VTMel) ...
Highlights
- A VTMel spectrogram that supplements the Mel spectrogram is proposed, highlighting high-frequency components.
Read More
Speech Signal Feature Extraction Based on Wavelet Transform
ICBMI '11: Proceedings of the 2011 International Conference on Intelligent Computation and Bio-Medical Instrumentation

Analysis of the voice pronunciation mechanism and performance differences of normal voice in the frequency domain, wavelet transform is used to do signal decomposition, and emphasizing characteristics of voice, with these two characteristic parameters ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICFNDS '22: Proceedings of the 6th International Conference on Future Networks & Distributed Systems
December 2022
734 pages
ISBN:9781450399050
DOI:10.1145/3584202

Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 9 May 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
COVID-19 diagnosis
Deep learning
Machine learning
Mel spectrogram
Speech signal
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 49
  Total Downloads
- Downloads (Last 12 months)49
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Artificial Intelligence-based Speech Signal for COVID-19 Diagnostics

ICFNDS '22: Proceedings of the 6th International Conference on Future Networks & Distributed Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Automated speech signal analysis based on feature extraction and classification of spasmodic dysphonia: a performance comparison of different classifiers

Speech emotion recognition based on optimized deep features of dual-channel complementary spectrogram

Speech Signal Feature Extraction Based on Wavelet Transform

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Artificial Intelligence-based Speech Signal for COVID-19 Diagnostics

ICFNDS '22: Proceedings of the 6th International Conference on Future Networks & Distributed Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Automated speech signal analysis based on feature extraction and classification of spasmodic dysphonia: a performance comparison of different classifiers

Speech emotion recognition based on optimized deep features of dual-channel complementary spectrogram

Speech Signal Feature Extraction Based on Wavelet Transform

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media