Skip to main content
Log in

Discriminating Parkinson diseased and healthy people using modified MFCC filter bank approach

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

In this paper a modified Mel scaled filter bank-based approach to discriminate people suffering from Parkinson disease (PD) in their early stages from healthy people using speech samples is proposed. Parkinson’s disease not only affects the muscular activities of the human body but also affects the speech of the diseased. So, the speech features of Parkinson affected people tend to vary and hence differ from the speech features of healthy people. In this paper, the speech feature used for discriminating the two groups is the Mel frequency cepstral coefficients (MFCC) extracted from speech samples of both the PD and healthy people. The traditional way of computing the MFCC coefficients involves the design of the Mel filter bank. These filters are usually designed according to the auditory or acoustic system of human ear which follows the Mel scale. In this study, modification to this Mel scaled bank of filters is done by varying its bandwidth in the region of interest to compute the feature, MFCC and its performance is then compared with the conventionally designed MFCC filter bank for the said application. The performance is compared in terms of classification accuracy using radial basis network classifier. The results show an improvement of 6.3% in the classification accuracy obtained using the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Benba, A., Jilbab, A., & Hammouch, A. (2016). Discriminating between patients with Parkinson’s and neurological diseases using cepstral analysis. IEEE Transactions on Neural Systems and Rehabilitation Engineering,24(10), 1100–1108.

    Article  Google Scholar 

  • Benba, A., Jilbab, A., Hammouch, A., & Sandabad, S. (2015). Voiceprints analysis using MFCC and SVM for detecting patients with Parkinson’s disease. In IEEE 1st international conference on electrical and information technologies ICEIT2015 (pp. 300–304).

  • Braga, D., Madureira, A. M., Coelho, L., & Ajith, R. (2019). Automatic detection of Parkinson’s disease based on acoustic analysis of speech. Engineering Applications of Artificial Intelligence,77, 148–158.

    Article  Google Scholar 

  • Do, M. N. (2016) An automatic speaker recognition system, Audio Visual Communications Laboratory, Swiss Federal Institute of Technology, Lausanne, Switzerland. Retrieved May, 2016, from http://lcavwww.epfl.ch/~minhdo/asr_project/.

  • Godino-Llorente, J. I., Gomez-Vilda, P., & Blanco-Velasco, M. (2006). Dimensionality reduction of a pathological voice quality assessment system based on gaussian mixture models and short-term cepstral parameters. IEEE Transactions on Biomedical Engineering,53(10), 1943–1953.

    Article  Google Scholar 

  • Han, W., Chan, C. F., Choy, C. S., Pun, K. P. (2006). An efficient MFCC extraction method in speech recognition. In Proceedings of the IEEE international symposium on circuits and systems (ISCAS’2006) (pp. 145–148).

  • Hornykiewicz, G. O. (1998). Biochemical aspects of Parkinson’s disease”. Neurology,51, S2–S9.

    Article  Google Scholar 

  • Kopparapu, S & Narayana, L (2010). Choice of Mel filter bank in computing MFCC of a resampled speech. In International conference on information science, signal processing and their applications (ISSPA).

  • Molau, S., Pitz, M., Schliitel, R., Ney, H. (2001). Computing Mel-frequency cepstral coefficients on the power spectrum”. In Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP’2001) (pp. 73–76).

  • Okan Sakar, C., Serbes, G., Gunduz, A., Tunc, H. C., Nizam, H., Sakar, B. E., et al. (2019). A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform. Applied Soft Computing Journal,74, 255–263.

    Article  Google Scholar 

  • Orozco-Arroyave, J. R. et al. (2013). Perceptual analysis of speech signals from people with Parkinson’s disease. In IWINAC 2013, Part 1, LNCS 7930 (pp. 201–211). Berlin Heidelberg: Springer-Verlag.

    Chapter  Google Scholar 

  • Oung, Q. W., Basah, S. N., Muthusamy, H., Vijean, V., Lee, H. (2017) Evaluation of short-term cepstral based features for detection of Parkinson’s Disease severity levels through speech signals. In MUCET 2017 IOP publishing IOP conference series: Materials science and engineering (p. 318).

  • Retrieved December, 2018, from https://mccormickml.com/2013/08/15/radial-basis-function-network-rbfn-tutorial.

  • Retrieved December, 2018, from https://www.saedsayad.com/artificial_neural_network_rbf.htm.

  • Retrieved May, 2015, from https://cmusphinx.sourceforge.net/sphinx4/javadoc/edu/cmu/sphinx/frontend/frequencywarp/MelFrequencyFilterBank.html.

  • Rusz, J., & Cmejla, R. (2011). Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson’s disease. Journal of the Acoustical Society of America,129(1), 350–367.

    Article  Google Scholar 

  • Shahbakhi, M., Far, D. T., & Tahami, E. (2014). Speech analysis for diagnosis of Parkinson’s disease using genetic algorithm and support vector machine. Journal of Biomedical Science and Engineering,7, 147–156.

    Article  Google Scholar 

  • Singh, S., & Xu, W. (2019). Robust detection of Parkinson’s Disease using harvested smartphone voice data: A telemedicine approach. Telemedicine and e-Health. https://doi.org/10.1089/tmj.2018.0271.

    Article  Google Scholar 

  • Skowronski, M. & Harris, J. (2002). Increased MFCC filter bandwidth for noise-robust phoneme recognition. In IEEE international conference on acoustics, speech, and signal processing ICASSP 2002, (pp. 801–804).

  • Skowronski, M. & Harris, J. (2003). Improving the filter bank of classic speech feature extraction algorithm. In Proceedings of the 2003 international symposiumon circuits and systems ISCAS 2003, (pp. 281–284).

  • Tsanas, A., Little, M. A., McSharry, P. E., Spielman, J., & Ramig, L. O. (2012). Novel speech signal processing algorithms for high accuracy classification of Parkinson’s disease. IEEE Transactions on Biomedical Engineering,59(5), 1264–1271.

    Article  Google Scholar 

  • Upadhya, S. S., Cheeran, A. N., & Nirmal, J. H. (2018a). Thomson multitaper MFCC and PLP voice features for early detection of Parkinson disease. Elsevier’s Biomedical Signal Processing and Control,46, 293–301.

    Article  Google Scholar 

  • Upadhya, S. S., Cheeran, A. N., & Nirmal, J. H. (2018b). Multitaper perceptual linear prediction features of voice samples to discriminate healthy persons from early stage Parkinson diseased persons. International Journal Speech Technology,21(3), 391–399.

    Article  Google Scholar 

  • Vignolo, L. D., Rufiner, H. L., Milone, D. H. (2009). Genetic optimization of cepstrum filterbank for phoneme classification. In Proceedings of the second international conference on bio-inspired systems and signal processing (BIOSIGNALS 2009), pp. 179-185.

  • Vizza, P., Tradigo, G., Mirarchi, D., Bossio, R. B., Lombardo, N., Arabia, G., et al. (2019). Methodologies of speech analysis for neurodegenerative diseases evaluation. International Journal of Medical Informatics,122, 45–54.

    Article  Google Scholar 

  • Wrigley, S. N. (2015) Speech recognition by dynamic time warping, Speech and Hearing Research Group, University of Sheffield, Sheffield S1 4DP, United Kingdom. Retrieved March, 2015, from https://www.dcs.shef.ac.uk/~ stu/com326/sym.html.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Savitha S. Upadhya.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Upadhya, S.S., Cheeran, A.N. & Nirmal, J.H. Discriminating Parkinson diseased and healthy people using modified MFCC filter bank approach. Int J Speech Technol 22, 1021–1029 (2019). https://doi.org/10.1007/s10772-019-09647-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-019-09647-0

Keywords

Navigation