Discriminating Parkinson diseased and healthy people using modified MFCC filter bank approach

Upadhya, Savitha S.; Cheeran, A. N.; Nirmal, J. H.

doi:10.1007/s10772-019-09647-0

Discriminating Parkinson diseased and healthy people using modified MFCC filter bank approach

Published: 08 October 2019

Volume 22, pages 1021–1029, (2019)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

433 Accesses
16 Citations
Explore all metrics

Abstract

In this paper a modified Mel scaled filter bank-based approach to discriminate people suffering from Parkinson disease (PD) in their early stages from healthy people using speech samples is proposed. Parkinson’s disease not only affects the muscular activities of the human body but also affects the speech of the diseased. So, the speech features of Parkinson affected people tend to vary and hence differ from the speech features of healthy people. In this paper, the speech feature used for discriminating the two groups is the Mel frequency cepstral coefficients (MFCC) extracted from speech samples of both the PD and healthy people. The traditional way of computing the MFCC coefficients involves the design of the Mel filter bank. These filters are usually designed according to the auditory or acoustic system of human ear which follows the Mel scale. In this study, modification to this Mel scaled bank of filters is done by varying its bandwidth in the region of interest to compute the feature, MFCC and its performance is then compared with the conventionally designed MFCC filter bank for the said application. The performance is compared in terms of classification accuracy using radial basis network classifier. The results show an improvement of 6.3% in the classification accuracy obtained using the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Parkinson’s Disease Recognition from Speech Signal Using Discrete Wavelet Transform, Delta, Delta-Delta, and K-Nearest Neighbor

A Review on Early Diagnosis of Parkinson’s Disease Using Speech Signal Parameters Based on Machine Learning Technique

Speech processing for early Parkinson’s disease diagnosis: machine learning and deep learning-based approach

Article 04 July 2022

References

Benba, A., Jilbab, A., & Hammouch, A. (2016). Discriminating between patients with Parkinson’s and neurological diseases using cepstral analysis. IEEE Transactions on Neural Systems and Rehabilitation Engineering,24(10), 1100–1108.
Article Google Scholar
Benba, A., Jilbab, A., Hammouch, A., & Sandabad, S. (2015). Voiceprints analysis using MFCC and SVM for detecting patients with Parkinson’s disease. In IEEE 1st international conference on electrical and information technologies ICEIT’2015 (pp. 300–304).
Braga, D., Madureira, A. M., Coelho, L., & Ajith, R. (2019). Automatic detection of Parkinson’s disease based on acoustic analysis of speech. Engineering Applications of Artificial Intelligence,77, 148–158.
Article Google Scholar
Do, M. N. (2016) An automatic speaker recognition system, Audio Visual Communications Laboratory, Swiss Federal Institute of Technology, Lausanne, Switzerland. Retrieved May, 2016, from http://lcavwww.epfl.ch/~minhdo/asr_project/.
Godino-Llorente, J. I., Gomez-Vilda, P., & Blanco-Velasco, M. (2006). Dimensionality reduction of a pathological voice quality assessment system based on gaussian mixture models and short-term cepstral parameters. IEEE Transactions on Biomedical Engineering,53(10), 1943–1953.
Article Google Scholar
Han, W., Chan, C. F., Choy, C. S., Pun, K. P. (2006). An efficient MFCC extraction method in speech recognition. In Proceedings of the IEEE international symposium on circuits and systems (ISCAS’2006) (pp. 145–148).
Hornykiewicz, G. O. (1998). Biochemical aspects of Parkinson’s disease”. Neurology,51, S2–S9.
Article Google Scholar
Kopparapu, S & Narayana, L (2010). Choice of Mel filter bank in computing MFCC of a resampled speech. In International conference on information science, signal processing and their applications (ISSPA).
Molau, S., Pitz, M., Schliitel, R., Ney, H. (2001). Computing Mel-frequency cepstral coefficients on the power spectrum”. In Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP’2001) (pp. 73–76).
Okan Sakar, C., Serbes, G., Gunduz, A., Tunc, H. C., Nizam, H., Sakar, B. E., et al. (2019). A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform. Applied Soft Computing Journal,74, 255–263.
Article Google Scholar
Orozco-Arroyave, J. R. et al. (2013). Perceptual analysis of speech signals from people with Parkinson’s disease. In IWINAC 2013, Part 1, LNCS 7930 (pp. 201–211). Berlin Heidelberg: Springer-Verlag.
Chapter Google Scholar
Oung, Q. W., Basah, S. N., Muthusamy, H., Vijean, V., Lee, H. (2017) Evaluation of short-term cepstral based features for detection of Parkinson’s Disease severity levels through speech signals. In MUCET 2017 IOP publishing IOP conference series: Materials science and engineering (p. 318).
Retrieved December, 2018, from https://mccormickml.com/2013/08/15/radial-basis-function-network-rbfn-tutorial.
Retrieved December, 2018, from https://www.saedsayad.com/artificial_neural_network_rbf.htm.
Retrieved May, 2015, from https://cmusphinx.sourceforge.net/sphinx4/javadoc/edu/cmu/sphinx/frontend/frequencywarp/MelFrequencyFilterBank.html.
Rusz, J., & Cmejla, R. (2011). Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson’s disease. Journal of the Acoustical Society of America,129(1), 350–367.
Article Google Scholar
Shahbakhi, M., Far, D. T., & Tahami, E. (2014). Speech analysis for diagnosis of Parkinson’s disease using genetic algorithm and support vector machine. Journal of Biomedical Science and Engineering,7, 147–156.
Article Google Scholar
Singh, S., & Xu, W. (2019). Robust detection of Parkinson’s Disease using harvested smartphone voice data: A telemedicine approach. Telemedicine and e-Health. https://doi.org/10.1089/tmj.2018.0271.
Article Google Scholar
Skowronski, M. & Harris, J. (2002). Increased MFCC filter bandwidth for noise-robust phoneme recognition. In IEEE international conference on acoustics, speech, and signal processing ICASSP 2002, (pp. 801–804).
Skowronski, M. & Harris, J. (2003). Improving the filter bank of classic speech feature extraction algorithm. In Proceedings of the 2003 international symposiumon circuits and systems ISCAS 2003, (pp. 281–284).
Tsanas, A., Little, M. A., McSharry, P. E., Spielman, J., & Ramig, L. O. (2012). Novel speech signal processing algorithms for high accuracy classification of Parkinson’s disease. IEEE Transactions on Biomedical Engineering,59(5), 1264–1271.
Article Google Scholar
Upadhya, S. S., Cheeran, A. N., & Nirmal, J. H. (2018a). Thomson multitaper MFCC and PLP voice features for early detection of Parkinson disease. Elsevier’s Biomedical Signal Processing and Control,46, 293–301.
Article Google Scholar
Upadhya, S. S., Cheeran, A. N., & Nirmal, J. H. (2018b). Multitaper perceptual linear prediction features of voice samples to discriminate healthy persons from early stage Parkinson diseased persons. International Journal Speech Technology,21(3), 391–399.
Article Google Scholar
Vignolo, L. D., Rufiner, H. L., Milone, D. H. (2009). Genetic optimization of cepstrum filterbank for phoneme classification. In Proceedings of the second international conference on bio-inspired systems and signal processing (BIOSIGNALS 2009), pp. 179-185.
Vizza, P., Tradigo, G., Mirarchi, D., Bossio, R. B., Lombardo, N., Arabia, G., et al. (2019). Methodologies of speech analysis for neurodegenerative diseases evaluation. International Journal of Medical Informatics,122, 45–54.
Article Google Scholar
Wrigley, S. N. (2015) Speech recognition by dynamic time warping, Speech and Hearing Research Group, University of Sheffield, Sheffield S1 4DP, United Kingdom. Retrieved March, 2015, from https://www.dcs.shef.ac.uk/~ stu/com326/sym.html.

Download references

Author information

Authors and Affiliations

Electronics and Telecommunication Engineering Department, Fr. C. Rodrigues Institute of Technology, Vashi, Navi Mumbai, Maharashtra, 400703, India
Savitha S. Upadhya
Electrical Engineering Department, Veermata Jijabai Technological Institute, Matunga, Mumbai, India
A. N. Cheeran
Electronics Engineering Department, K J Somaiya College of Engineering, Vidyavihar, Mumbai, Maharashtra, 400077, India
J. H. Nirmal

Authors

Savitha S. Upadhya
View author publications
You can also search for this author in PubMed Google Scholar
A. N. Cheeran
View author publications
You can also search for this author in PubMed Google Scholar
J. H. Nirmal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Savitha S. Upadhya.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Upadhya, S.S., Cheeran, A.N. & Nirmal, J.H. Discriminating Parkinson diseased and healthy people using modified MFCC filter bank approach. Int J Speech Technol 22, 1021–1029 (2019). https://doi.org/10.1007/s10772-019-09647-0

Download citation

Received: 19 June 2019
Accepted: 26 September 2019
Published: 08 October 2019
Issue Date: December 2019
DOI: https://doi.org/10.1007/s10772-019-09647-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discriminating Parkinson diseased and healthy people using modified MFCC filter bank approach

Abstract

Access this article

Similar content being viewed by others

Parkinson’s Disease Recognition from Speech Signal Using Discrete Wavelet Transform, Delta, Delta-Delta, and K-Nearest Neighbor

A Review on Early Diagnosis of Parkinson’s Disease Using Speech Signal Parameters Based on Machine Learning Technique

Speech processing for early Parkinson’s disease diagnosis: machine learning and deep learning-based approach

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Discriminating Parkinson diseased and healthy people using modified MFCC filter bank approach

Abstract

Access this article

Similar content being viewed by others

Parkinson’s Disease Recognition from Speech Signal Using Discrete Wavelet Transform, Delta, Delta-Delta, and K-Nearest Neighbor

A Review on Early Diagnosis of Parkinson’s Disease Using Speech Signal Parameters Based on Machine Learning Technique

Speech processing for early Parkinson’s disease diagnosis: machine learning and deep learning-based approach

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation