Skip to main content
Log in

Analysis of multiple types of voice recordings in cepstral domain using MFCC for discriminating between patients with Parkinson’s disease and healthy people

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

In this study, we wanted to discriminate between two groups of people. The database used in this study contains 20 patients with Parkinson’s disease (PD) and 20 healthy people. Three types of sustained vowels (/a/, /o/ and /u/) were recorded from each participant and then the analyses were done on these voice samples. The technique used in this study is to extract voiceprint from each voice samples by using mel frequency cepstral coefficients (MFCCs). The extracted MFCC were compressed by calculating their average value in order to extract the voiceprint from each voice recording. Subsequently, a classification method was performed using leave one subject out (LOSO) validation scheme along with support vector machines (SVMs). We also used an independent test to validate our results by using another database which contains 28 PD patients. Based on the research result, the best obtained classification accuracy using LOSO on the first dataset was 82.50 % using MLP kernel of SVM on sustained vowel /u/. And the maximum classification accuracy using the independent test was 100 % using sustained vowel /a/ with polynomial kernel of the SVM and with MLP kernel of the SVM. This result was also achieved using sustained vowel /o/ with polynomial kernel of the SVM.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Benba, A., Jilbab, A., & Hammouch, A. (2014a). Voice analysis for detecting persons with Parkinson’s disease using MFCC and VQ. In The 2014 international conference on circuits, systems and signal processing, 23–25 September 2014. Saint Petersburg: Saint Petersburg State Polytechnic University.

  • Benba, A., Jilbab, A., & Hammouch, A. (2014b). Hybridization of best acoustic cues for detecting persons with Parkinson’s disease. In 2nd World conference on complex system (WCCS14), 10–12 November 2014. Agadir: IEEE.

  • Benba, A., Jilbab, A., & Hammouch, A. (2014c). Voiceprint analysis using perceptual linear prediction and support vector machines for detecting persons with Parkinson’s disease. In The 3rd International conference on health science and biomedical systems (HSBS14), Florence, Italy, 22–24 November 2014.

  • Benba, A., Jilbab, A., & Hammouch, A. (2014d). Voice analysis for detecting persons with Parkinson’s disease using PLP and VQ. Journal of Theoretical and Applied Information Technology, 70(3), 443–450.

  • Benba, A., et al. (2015a). Voiceprints analysis using MFCC and SVM for detecting patients with Parkinson’s disease. In 2015 International conference on electrical and information technologies (ICEIT). IEEE, Marrakech, Morocco.

  • Benba, A., Jilbab, A., & Hammouch, A. (2015b). Detecting patients with Parkinson’s disease using PLP and VQ. In The 7th international conference on information technology, Amman, Jordan, 12–15 May 2015.

  • Benba, A., Jilbab, A., & Hammouch, A. (2015c). Detecting patients with Parkinson’s disease using mel frequency cepstral coefficients and support vector machines. International Journal on Electrical Engineering and Informatics, 7(2), 297.

    Article  Google Scholar 

  • Benba, A., Jilbab, A., & Hammouch, A. Discriminating between patients with Parkinson’s and neurological diseases using cepstral analysis. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 99, 1–1. doi: 10.1109/TNSRE.2016.2533582.

  • Frail, R., Godino-Llorente, J. I., Saenz-Lechon, N., Osma-Ruiz, V., & Fredouille, C. (2009). MFCC-based remote pathology detection on speech transmitted through the telephone channel. In Proceedings of biosignals, Porto, 2009.

  • Huse, D. M., et al. (2005). Burden of illness in Parkinson’s disease. Movement Disorders, 20(11), 1449–1454.

    Article  Google Scholar 

  • Ishihara, L., & Brayne, C. (2006). A systematic review of depression and mental illness preceding Parkinson’s disease. Acta Neurologica Scandinavica, 113(4), 211–220.

    Article  Google Scholar 

  • Jafari, A. (2013). Classification of Parkinson’s disease patients using nonlinear phonetic features and Mel-frequency cepstral analysis. Biomedical Engineering: Applications, Basis and Communications, 52(4). http://www.worldscientific.com.

  • Jankovic, J. (2008). Parkinson’s disease: Clinical features and diagnosis. Journal of Neurology, Neurosurgery and Psychiatry, 79(4), 368–376.

    Article  Google Scholar 

  • Kapoor, T., & Sharma, R. K. (2011). Parkinson’s disease diagnosis using Mel-frequency cepstral coefficients and vector quantization. International Journal of Computer Application, 14(3). doi:10.5120/1821-2393

  • Kumar, Ch. S., & Mallikarjuna, P. R. (2011). Design of an automatic speaker recognition system using MFCC, vector quantization and LBG algorithm. International Journal on Computer Science and Engineering, 3(8), 2942–2954.

    Google Scholar 

  • Little, M. A., McSharry, P. E., Roberts, S. J., Costello, D. A., & Moroz, I. M. (2007). Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection. Biomedical Engineering Online, 6, 23.

    Article  Google Scholar 

  • Little, M. A., et al. (2009). Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE Transactions on Biomedical Engineering, 56(4), 1015–1022.

    Article  Google Scholar 

  • Martinez, J., Perez, H., Escamilla, E., & Suzuki, M. M. (2012). Speaker recognition using mel frequency cepstral coefficients (MFCC) and vector quantization (VQ) techniques. In IEEE electrical communications and computers, Cholula, Puebla, February 2012 (pp. 248–251).

  • O’Sullivan, S. B., & Schmitz, T. J. (2007). Parkinson disease. In Physical rehabilitation (5th ed., pp. 856–894). Philadelphia, PA: F. A. Davis Company.

  • Rahn, D. A., Chou, M., Jiang, J. J., & Zhang, Y. (2007). Phonatory impairment in Parkinson’s disease: Evidence from nonlinear dynamic analysis and perturbation analysis. Journal of Voice, 21, 64–71.

    Article  Google Scholar 

  • Rani, U. k., & Holi, M. S. (2013). Automatic detection of neurological disordered voices using mel cepstral coefficients and neural networks. In 2013 IEEE point-of-care healthcare technologies (PHT), Bangalore, India, 16–18 January 2013.

  • Sakar, B. E., et al. (2013). Collection and analysis of a Parkinson speech dataset with multiple types of sound recordings. IEEE Journal of Biomedical and Health Informatics, 17(4), 828–834.

    Article  Google Scholar 

  • Yang, Z. R., et al. (2005). RONN: The bio-basis function neural network technique applied to the detection of natively disordered regions in proteins. Bioinformatics, 21(16), 3369–3376.

    Article  Google Scholar 

  • Young, S., Evermann, G., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P. (2001–2006). The HTK book (for HTK version 3.4). Cambridge: Cambridge University Engineering Department.

Download references

Acknowledgments

The authors would like to thank Mr. Erdem Isenkul from Department of Computer Engineering at Istanbul University, Mr. Thomas R. Przybeck and Mr. Daniel Wood from United States Peace Corps Volunteers (Morocco 2013–2015), and all of the participants involved in the dataset collection process.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Achraf Benba.

Ethics declarations

Conflict of interest

Achraf Benba declares that he has no conflict of interest. And he doesn’t have any financial relationship with the organization. Abdelilah Jilbab declares that he has no conflict of interest. And he doesn’t have any financial relationship with the organization. Ahmed Hammouch declares that he has no conflict of interest. And he doesn’t have any financial relationship with the organization. All procedures followed were in accordance with the ethical standards of the Responsible Committee on Human Experimentation (Institutional and National) and with the Helsinki Declaration of 1975, as revised in 2008 (5).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Benba, A., Jilbab, A. & Hammouch, A. Analysis of multiple types of voice recordings in cepstral domain using MFCC for discriminating between patients with Parkinson’s disease and healthy people. Int J Speech Technol 19, 449–456 (2016). https://doi.org/10.1007/s10772-016-9338-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-016-9338-4

Keywords

Navigation