ABSTRACT
The speech signal has numerous features that represent the characteristics of a specific language and recognize emotions. It also contains information that can be used to identify the mental, psychological, and physical states of the speaker. Recently, the acoustic analysis of speech signals offers a practical, automated, and scalable method for medical diagnosis and monitoring symptoms of many diseases. In this paper, we explore the deep acoustic features from confirmed positive and negative cases of COVID-19 and compare the performance of the acoustic features and COVID-19 symptoms in terms of their ability to diagnose COVID-19. The proposed methodology consists of the pre-trained Visual Geometry Group (VGG-16) model based on Mel spectrogram images to extract deep audio features. In addition to the K-means algorithm that determines effective features, followed by a Genetic Algorithm-Support Vector Machine (GA-SVM) classifier to classify cases. The experimental findings indicate the proposed methodology’s capability to classify COVID-19 and NOT COVID-19 from acoustic features compared to COVID-19 symptoms, achieving an accuracy of 97%. The experimental results show that the proposed method remarkably improves the accuracy of COVID-19 detection over the handcrafted features used in previous studies.
- Musaed Alhussein and Ghulam Muhammad. 2018. Voice pathology detection using deep learning on mobile healthcare framework. IEEE Access 6(2018), 41034–41041.Google ScholarCross Ref
- Lmar M Babrak, Joseph Menetski, Michael Rebhan, Giovanni Nisato, Marc Zinggeler, Noé Brasier, Katja Baerenfaller, Thomas Brenzikofer, Laurenz Baltzer, Christian Vogler, 2019. Traditional and digital biomarkers: two worlds apart?Digital biomarkers 3, 2 (2019), 92–102.Google Scholar
- Andrea Coravos, Sean Khozin, and Kenneth D Mandl. 2019. Developing and adopting safe and effective digital biomarkers to improve patient outcomes. NPJ digital medicine 2, 1 (2019), 1–5.Google Scholar
- Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Machine learning 20, 3 (1995), 273–297.Google ScholarDigital Library
- Domenico Cucinotta and Maurizio Vanelli. 2020. WHO declares COVID-19 a pandemic. Acta Bio Medica: Atenei Parmensis 91, 1 (2020), 157.Google Scholar
- Nicholas Cummins, Alice Baird, and Bjoern W Schuller. 2018. Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning. Methods 151(2018), 41–54.Google ScholarCross Ref
- Guy Fagherazzi, Aurélie Fischer, Muhannad Ismael, and Vladimir Despotovic. 2021. Voice for health: the use of vocal biomarkers from research to clinical practice. Digital biomarkers 5, 1 (2021), 78–88.Google Scholar
- María Teresa García-Ordás, José Alberto Benítez-Andrades, Isaías García-Rodríguez, Carmen Benavides, and Héctor Alaiz-Moretón. 2020. Detecting respiratory pathologies using convolutional neural networks and variational autoencoders for unbalancing data. Sensors 20, 4 (2020), 1214.Google ScholarCross Ref
- Aurélien Géron. 2019. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. " O’Reilly Media, Inc.", Sebastopol, CA, USA.Google Scholar
- Jing Han, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, and Cecilia Mascolo. 2021. Exploring automatic COVID-19 diagnosis via voice and symptoms from crowdsourced data. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 8328–8332.Google ScholarCross Ref
- Sandeep Kaur and Sheetal Kalra. 2016. Disease prediction using hybrid K-means and support vector machine. In 2016 1st India international conference on information processing (IICIP). IEEE, 1–6.Google ScholarCross Ref
- Murtaza Ali Khan. 2021. An automated and fast system to identify COVID-19 from X-ray radiograph of the chest using image processing and machine learning. International journal of imaging systems and technology 31, 2(2021), 499–508.Google Scholar
- Siddique Latif, Junaid Qadir, Adnan Qayyum, Muhammad Usama, and Shahzad Younis. 2020. Speech technology for healthcare: Opportunities, challenges, and state of the art. IEEE Reviews in Biomedical Engineering 14 (2020), 342–356.Google ScholarCross Ref
- Navin Kumar Manaswi. 2018. Deep learning with applications using python: Chatbots and Face, Object, and Speech Recognition With TensorFlow and Keras. Apress Berkeley, CA.Google Scholar
- Brian McFee, Colin Raffel, Dawen Liang, Daniel P Ellis, Matt McVicar, Eric Battenberg, and Oriol Nieto. 2015. librosa: Audio and music signal analysis in python. In Proceedings of the 14th python in science conference, Vol. 8. Citeseer, 18–25.Google ScholarCross Ref
- Nick Pentreath. 2015. Machine learning with spark. Packt Publishing, Birmingham.Google Scholar
- Kun Qian, Xiao Li, Haifeng Li, Shengchen Li, Wei Li, Zuoliang Ning, Shuai Yu, Limin Hou, Gang Tang, Jing Lu, 2020. Computer audition for healthcare: Opportunities and challenges. Frontiers in Digital Health 2 (2020), 5.Google ScholarCross Ref
- Lawrence R Rabiner, Ronald W Schafer, 2007. Introduction to digital speech processing. Foundations and Trends® in Signal Processing 1, 1–2(2007), 1–194.Google Scholar
- Saloni, Rajender K Sharma, and Anil K Gupta. 2014. Disease detection using voice analysis: A review. International Journal of Medical Engineering and Informatics 6, 3(2014), 189–209.Google ScholarDigital Library
- Soumya Sen, Anjan Dutta, and Nilanjan Dey. 2019. Audio processing and speech recognition: concepts, techniques and research overviews. Springer, Berlin.Google Scholar
- Maksut Senbekov, Timur Saliev, Zhanar Bukeyeva, Aigul Almabayeva, Marina Zhanaliyeva, Nazym Aitenova, Yerzhan Toishibekov, and Ildar Fakhradiyev. 2020. The recent progress and applications of digital technologies in healthcare: a review. International journal of telemedicine and applications 2020 (2020).Google ScholarDigital Library
- Garima Sharma, Kartikeyan Umapathy, and Sridhar Krishnan. 2020. Trends in audio signal feature extraction methods. Applied Acoustics 158(2020), 107020.Google ScholarCross Ref
- Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In 3rd International Conference on Learning Representations,ICLR. 1–4.Google Scholar
- Brian Stasak, Zhaocheng Huang, Sabah Razavi, Dale Joachim, and Julien Epps. 2021. Automatic detection of COVID-19 based on short-duration acoustic smartphone speech analysis. Journal of Healthcare Informatics Research 5, 2 (2021), 201–217.Google ScholarCross Ref
- Mohammed Usman, Vinit Kumar Gunjan, Mohd Wajid, Mohammed Zubair, 2022. Speech as a Biomarker for COVID-19 Detection Using Machine Learning. Computational Intelligence and Neuroscience 2022 (2022).Google Scholar
- Laura Verde, Giuseppe De Pietro, and Giovanna Sannino. 2021. Artificial Intelligence Techniques for the Non-invasive Detection of COVID-19 Through the Analysis of Voice Signals. Arabian Journal for Science and Engineering(2021), 1–11.Google Scholar
- Pauline Vetter, Diem Lan Vu, Arnaud G L’Huillier, Manuel Schibler, Laurent Kaiser, and Frederique Jacquerioz. 2020. Clinical features of covid-19.Google Scholar
- Eyal Wirsansky. 2020. Hands-on genetic algorithms with Python: applying genetic algorithms to solve real-world deep learning and artificial intelligence problems. Packt Publishing Ltd, Birmingham, U.K.Google Scholar
- Tong Xia, Dimitris Spathis, J Ch, Andreas Grammenos, Jing Han, Apinan Hasthanasombat, Erika Bondareva, Ting Dang, Andres Floto, Pietro Cicuta, 2021. COVID-19 Sounds: A Large-Scale Audio Dataset for Digital Respiratory Screening. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2).Google Scholar
- Laiba Zahid, Muazzam Maqsood, Mehr Yahya Durrani, Maheen Bakhtyar, Junaid Baber, Habibullah Jamal, Irfan Mehmood, and Oh-Young Song. 2020. A spectrogram-based deep feature assisted computer-aided diagnostic system for Parkinson’s disease. IEEE Access 8(2020), 35482–35495.Google ScholarCross Ref
- Bichen Zheng, Sang Won Yoon, and Sarah S Lam. 2014. Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms. Expert Systems with Applications 41, 4 (2014), 1476–1482.Google ScholarDigital Library
Index Terms
- Artificial Intelligence-based Speech Signal for COVID-19 Diagnostics
Recommendations
Automated speech signal analysis based on feature extraction and classification of spasmodic dysphonia: a performance comparison of different classifiers
Spasmodic Dysphonia is a voice disorder caused due to spasm of involuntary muscles in the voice box. These spasms can leads to breathy, soundless voice breaks, strangled voice by interrupting the opening of the vocal folds. There is no specific test for ...
Speech emotion recognition based on optimized deep features of dual-channel complementary spectrogram
AbstractSpeech emotion recognition (SER) is an essential field of artificial intelligence. Although the Mel spectrogram is commonly used in SER, it emphasizes low-frequency emotional components. In this paper, we propose VMD-Teager-Mel (VTMel) ...
Highlights- A VTMel spectrogram that supplements the Mel spectrogram is proposed, highlighting high-frequency components.
Speech Signal Feature Extraction Based on Wavelet Transform
ICBMI '11: Proceedings of the 2011 International Conference on Intelligent Computation and Bio-Medical InstrumentationAnalysis of the voice pronunciation mechanism and performance differences of normal voice in the frequency domain, wavelet transform is used to do signal decomposition, and emphasizing characteristics of voice, with these two characteristic parameters ...
Comments