Skip to main content

Advertisement

Log in

Machine learning for the diagnosis of Parkinson’s disease using speech analysis: a systematic review

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

As voice technology continues to advance, clinical analysis of acoustic measures in speech patterns has the potential for early detection of Parkinson’s disease (PD), the second most prevalent neurodegenerative disorder for humans globally. Often returning inconsistent findings when examined by a specialist, voice has been demonstrated to be a vital biomarker in diagnosis. Many models of early onset PD in humans have been developed based on speech patterns. We conducted a search on three databases and included studies that evaluated the performance of machine learning algorithms in predicting the Parkinson’s disease through speech analysis via area under the receiver operating characteristic curve (ROC AUC) values. Studies were manually assessed for risks of selection bias, performance bias, and of the eight studies selected, the model with the highest number of samples and best ROC AUC of 92.4% was Gradient Boosted. Whereas the model with the lowest number of samples had the lowest ROC AUC of 50% for all models in the study, including Bayes Net, Zero R, Naïve Bayes, and Random Forest. Gradient Boosted models seemed to have higher accuracy of predictions in most studies. We determined that some machine learning algorithms can accurately predict the Parkinson’s disease using voice data on specific datasets, showing potential for use in clinical settings. This allows for earlier detection of Parkinson’s disease and improvement of clinical decisions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Data Availability

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.

References

  • Almeida, J. S., Rebouças Filho, P. P., Carneiro, T., Wei, W., Damaševičius, R., Maskeliūnas, R., & de Albuquerque, V. H. C. (2019). Detecting Parkinson’s Disease with sustained phonation and speech signals using machine learning techniques. Pattern Recognition Letters, 125, 55–62. https://doi.org/10.1016/j.patrec.2019.04.005

    Article  Google Scholar 

  • Bot, B. M., Suver, C., Neto, E. C., Kellen, M., Klein, A., Bare, C., Doerr, M., Pratap, A., Wilbanks, J., Dorsey, E. R., Friend, S. H., & Trister, A. D. (2016). The mPower study, Parkinson Disease mobile data collected using ResearchKit. Scientific Data, 3(1), https://doi.org/10.1038/sdata.2016.11

  • Carrón, J., Campos-Roca, Y., Madruga, M., & Pérez, C. J. (2021). A mobile-assisted voice condition analysis system for Parkinson’s Disease: Assessment of usability conditions. Biomedical Engineering Online, 20(1), https://doi.org/10.1186/s12938-021-00951-y

  • Deng, K., Li, Y., Zhang, H., Wang, J., Albin, R. L., & Guan, Y. (2022). Heterogeneous digital biomarker integration out-performs patient self-reports in predicting Parkinson’s Disease. Communications Biology, 5(1), https://doi.org/10.1038/s42003-022-03002-x

  • Dinesh, A., & He, J. (2017). Using machine learning to diagnose Parkinson’s disease from voice recordings. 2017 IEEE MIT Undergraduate Research Technology Conference (URTC).

  • Hireš, M., Gazda, M., Drotár, P., Pah, N. D., Motin, M. A., & Kumar, D. K. (2022). Convolutional neural network ensemble for Parkinson’s Disease detection from voice recordings. Computers in Biology and Medicine, 141(105021), 105021. https://doi.org/10.1016/j.compbiomed.2021.105021

    Article  Google Scholar 

  • Karabayir, I., Goldman, S. M., Pappu, S., & Akbilgic, O. (2020). Gradient boosting for Parkinson’s Disease diagnosis from voice recordings. BMC Medical Informatics and Decision Making, 20(1), https://doi.org/10.1186/s12911-020-01250-7

  • Little, M. A., McSharry, P. E., Roberts, S. J., Costello, D. A. E., & Moroz, I. M. (2007). Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection. Biomedical Engineering Online, 6(1), 23. https://doi.org/10.1186/1475-925x-6-23

    Article  Google Scholar 

  • Orozco-Arroyave, J. R., Arias-Londoño, J. D., Vargas-Bonilla, J. F., González-Rátiva, M. C., & Nöth, E. (2014). New Spanish speech corpus database for the analysis of people suffering from Parkinson ‘s disease. In Proceedings of the ninth international conference on language resources and evaluation (LREC’14) (pp. 342–347). ELRA.

  • Ouzzani, M., Hammady, H., Fedorowicz, Z., & Elmagarmid, A. (2016). Rayyan—a web and mobile app for systematic reviews. Systematic Reviews, 5(1), https://doi.org/10.1186/s13643-016-0384-4

  • Pützer, M., & Barry, W. J. (2007). Saarbruecken voice database [Data set]. https://www.stimmdatenbank.coli.uni-saarland.de/

  • Sujatha, J., & Rajagopalan, S. P. (2017). Performance evaluation of machine learning algorithms in the classification of Parkinson disease using voice attributes. International Journal of Applied Engineering Research, 12(21), 10669–10675.

    Google Scholar 

  • Valstar, M., Schuller, B., Smith, K., Eyben, F., Jiang, B., Bilakhia, S., Schnieder, S., Cowie, R., & Pantic, M. (2013). AVEC 2013: The continuous audio/visual emotion and depression recognition challenge. In Proceedings of the 3rd ACM international workshop on audio/visual emotion challenge.

  • Venegas, D. A. R. (2018). Dataset of vowels [Data set]. https://www.kaggle.com/datasets/darubiano57/dataset-of-vowels

  • Wroge, T. J., Ozkanca, Y., Demiroglu, C., Si, D., Atkins, D. C., & Ghomi, R. H. (2018). Parkinson’s disease diagnosis using machine learning and voice. In 2018 IEEE signal processing in medicine and biology symposium (SPMB).

Download references

Funding

None declared.

Author information

Authors and Affiliations

Authors

Contributions

C.B. contributed to the design and writing of the systematic review. N.B. contributed as the second reviewer to the writing of the systematic review and development of the quality scale of the studies included in the review. G.D. contributed as the third reviewer to the writing and final revision of the systematic review. O.M. contributed to the final revision of the systematic review. All authors have reviewed and approved the final version for publication and maintain accountability for all aspects of the article, including integrity and validity.

Corresponding author

Correspondence to Oge Marques.

Ethics declarations

Competing interests

All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bang, C., Bogdanovic, N., Deutsch, G. et al. Machine learning for the diagnosis of Parkinson’s disease using speech analysis: a systematic review. Int J Speech Technol 26, 991–998 (2023). https://doi.org/10.1007/s10772-023-10070-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-023-10070-9

Keywords

Navigation