Abstract
In this paper a modified Mel scaled filter bank-based approach to discriminate people suffering from Parkinson disease (PD) in their early stages from healthy people using speech samples is proposed. Parkinson’s disease not only affects the muscular activities of the human body but also affects the speech of the diseased. So, the speech features of Parkinson affected people tend to vary and hence differ from the speech features of healthy people. In this paper, the speech feature used for discriminating the two groups is the Mel frequency cepstral coefficients (MFCC) extracted from speech samples of both the PD and healthy people. The traditional way of computing the MFCC coefficients involves the design of the Mel filter bank. These filters are usually designed according to the auditory or acoustic system of human ear which follows the Mel scale. In this study, modification to this Mel scaled bank of filters is done by varying its bandwidth in the region of interest to compute the feature, MFCC and its performance is then compared with the conventionally designed MFCC filter bank for the said application. The performance is compared in terms of classification accuracy using radial basis network classifier. The results show an improvement of 6.3% in the classification accuracy obtained using the proposed method.
Similar content being viewed by others
References
Benba, A., Jilbab, A., & Hammouch, A. (2016). Discriminating between patients with Parkinson’s and neurological diseases using cepstral analysis. IEEE Transactions on Neural Systems and Rehabilitation Engineering,24(10), 1100–1108.
Benba, A., Jilbab, A., Hammouch, A., & Sandabad, S. (2015). Voiceprints analysis using MFCC and SVM for detecting patients with Parkinson’s disease. In IEEE 1st international conference on electrical and information technologies ICEIT’2015 (pp. 300–304).
Braga, D., Madureira, A. M., Coelho, L., & Ajith, R. (2019). Automatic detection of Parkinson’s disease based on acoustic analysis of speech. Engineering Applications of Artificial Intelligence,77, 148–158.
Do, M. N. (2016) An automatic speaker recognition system, Audio Visual Communications Laboratory, Swiss Federal Institute of Technology, Lausanne, Switzerland. Retrieved May, 2016, from http://lcavwww.epfl.ch/~minhdo/asr_project/.
Godino-Llorente, J. I., Gomez-Vilda, P., & Blanco-Velasco, M. (2006). Dimensionality reduction of a pathological voice quality assessment system based on gaussian mixture models and short-term cepstral parameters. IEEE Transactions on Biomedical Engineering,53(10), 1943–1953.
Han, W., Chan, C. F., Choy, C. S., Pun, K. P. (2006). An efficient MFCC extraction method in speech recognition. In Proceedings of the IEEE international symposium on circuits and systems (ISCAS’2006) (pp. 145–148).
Hornykiewicz, G. O. (1998). Biochemical aspects of Parkinson’s disease”. Neurology,51, S2–S9.
Kopparapu, S & Narayana, L (2010). Choice of Mel filter bank in computing MFCC of a resampled speech. In International conference on information science, signal processing and their applications (ISSPA).
Molau, S., Pitz, M., Schliitel, R., Ney, H. (2001). Computing Mel-frequency cepstral coefficients on the power spectrum”. In Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP’2001) (pp. 73–76).
Okan Sakar, C., Serbes, G., Gunduz, A., Tunc, H. C., Nizam, H., Sakar, B. E., et al. (2019). A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform. Applied Soft Computing Journal,74, 255–263.
Orozco-Arroyave, J. R. et al. (2013). Perceptual analysis of speech signals from people with Parkinson’s disease. In IWINAC 2013, Part 1, LNCS 7930 (pp. 201–211). Berlin Heidelberg: Springer-Verlag.
Oung, Q. W., Basah, S. N., Muthusamy, H., Vijean, V., Lee, H. (2017) Evaluation of short-term cepstral based features for detection of Parkinson’s Disease severity levels through speech signals. In MUCET 2017 IOP publishing IOP conference series: Materials science and engineering (p. 318).
Retrieved December, 2018, from https://mccormickml.com/2013/08/15/radial-basis-function-network-rbfn-tutorial.
Retrieved December, 2018, from https://www.saedsayad.com/artificial_neural_network_rbf.htm.
Retrieved May, 2015, from https://cmusphinx.sourceforge.net/sphinx4/javadoc/edu/cmu/sphinx/frontend/frequencywarp/MelFrequencyFilterBank.html.
Rusz, J., & Cmejla, R. (2011). Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson’s disease. Journal of the Acoustical Society of America,129(1), 350–367.
Shahbakhi, M., Far, D. T., & Tahami, E. (2014). Speech analysis for diagnosis of Parkinson’s disease using genetic algorithm and support vector machine. Journal of Biomedical Science and Engineering,7, 147–156.
Singh, S., & Xu, W. (2019). Robust detection of Parkinson’s Disease using harvested smartphone voice data: A telemedicine approach. Telemedicine and e-Health. https://doi.org/10.1089/tmj.2018.0271.
Skowronski, M. & Harris, J. (2002). Increased MFCC filter bandwidth for noise-robust phoneme recognition. In IEEE international conference on acoustics, speech, and signal processing ICASSP 2002, (pp. 801–804).
Skowronski, M. & Harris, J. (2003). Improving the filter bank of classic speech feature extraction algorithm. In Proceedings of the 2003 international symposiumon circuits and systems ISCAS 2003, (pp. 281–284).
Tsanas, A., Little, M. A., McSharry, P. E., Spielman, J., & Ramig, L. O. (2012). Novel speech signal processing algorithms for high accuracy classification of Parkinson’s disease. IEEE Transactions on Biomedical Engineering,59(5), 1264–1271.
Upadhya, S. S., Cheeran, A. N., & Nirmal, J. H. (2018a). Thomson multitaper MFCC and PLP voice features for early detection of Parkinson disease. Elsevier’s Biomedical Signal Processing and Control,46, 293–301.
Upadhya, S. S., Cheeran, A. N., & Nirmal, J. H. (2018b). Multitaper perceptual linear prediction features of voice samples to discriminate healthy persons from early stage Parkinson diseased persons. International Journal Speech Technology,21(3), 391–399.
Vignolo, L. D., Rufiner, H. L., Milone, D. H. (2009). Genetic optimization of cepstrum filterbank for phoneme classification. In Proceedings of the second international conference on bio-inspired systems and signal processing (BIOSIGNALS 2009), pp. 179-185.
Vizza, P., Tradigo, G., Mirarchi, D., Bossio, R. B., Lombardo, N., Arabia, G., et al. (2019). Methodologies of speech analysis for neurodegenerative diseases evaluation. International Journal of Medical Informatics,122, 45–54.
Wrigley, S. N. (2015) Speech recognition by dynamic time warping, Speech and Hearing Research Group, University of Sheffield, Sheffield S1 4DP, United Kingdom. Retrieved March, 2015, from https://www.dcs.shef.ac.uk/~ stu/com326/sym.html.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Upadhya, S.S., Cheeran, A.N. & Nirmal, J.H. Discriminating Parkinson diseased and healthy people using modified MFCC filter bank approach. Int J Speech Technol 22, 1021–1029 (2019). https://doi.org/10.1007/s10772-019-09647-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-019-09647-0