Skip to main content

Advertisement

Log in

A speech based diagnostic method for Alzheimer disease using machine learning

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

Alzheimer’s Disease (AD), the most common form of dementia, is a neurodegenerative condition which evolves over time. Patients experience short-term memory lapses at the start of this process, and by the end, they depend exclusively on signs. While there is no cure for AD, Medical treatments can be pursued for premature patients. Individuals suffering from early-stage Alzheimer’s disease have signs of linguistic impairment, including issues with word recall and word seeking. In this paper, we use stacking, an ensemble Machine Learning algorithm that learns how to combine the best predictions from multiple high-performing Machine Learning models, such as the K-Nearest Neighbors algorithm, Random Forest algorithm, Decision Tree algorithm, Support Vector Machine algorithm and Multi-Layer Perceptron algorithm, to accurately predict AD. The data-set is gathered from the dementia Pitt corpus repository. Our methodology focuses on utilizing extracted features from speech signals to detect abnormalities and deduce whether a patient has AD. 78% of AD sufferers and Healthy participants can be distinguished using the proposed technique. The average test accuracy using stacking classifier is 64%. Machine Learning algorithms employ 13 dimensionality reduction technique and an average accuracy reported by Neighborhood Component Analysis is 73%. 97% accuracy was attained using stacking classifier in machine learning algorithms. This data set demonstrates the ability to detect AD using only audio information.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Data availability

Publicly available datasets were analyzed in this study. The data can be found at https://dementia.talkbank.org.

References

  • Ammar, R. B., & Ayed, Y. B. (2020). Language-related features for early detection of Alzheimer disease. Procedia Computer Science, 176, 763–770.

    Article  Google Scholar 

  • Aouragh, A. A., Bahaj, M. & Gherabi, N., (2022). Comparative study of dimensionality reduction techniques and machine learning algorithms for Alzheimer's disease classification and prediction. In 2022 IEEE 3rd international conference on electronics, control, optimization and computer science (ICECOCS) (pp. 1–6).

  • Campbell, E. L., Fernández, L. D., Raboso, J. J., & García-Mateo, C. (2021). Alzheimer's dementia detection from audio and language modalities in spontaneous speech. IberSPEECH, 24–25 March 2021.

  • Dehak, N., Dumouchel, P., & Kenny, P. (2007). Modeling prosodic features with joint factor analysis for speaker verification. IEEE Transactions on Audio, Speech, and Language Processing, 15(7), 2095–2103.

  • Grinberg, Y., Al-Digeil, M., Dezfouli, M. K., Melati, D., Schmid, J. H., Cheben, P., Janz, S., & Xu, D. (2022). Nonlinear dimensionality reduction for low data regimes in photonics design. Photonics North (PN), 12, 23–56. https://doi.org/10.1109/PN56061.2022.9908251

    Article  Google Scholar 

  • Haulcy, R., & Glass, J. (2021). Classifying Alzheimer’s disease using audio and text-based representations of speech. Frontiers in Psychology, 11, 624137. https://doi.org/10.3389/fpsyg.2020.624137

    Article  Google Scholar 

  • Ivanova, O., & Meilán, J. J. G. (2022). Speech analysis in preclinical identification of Alzheimer’s disease. In J. M. Ferrández Vicente, J. R. Álvarez-Sánchez, F. de la Paz López, & H. Adeli (Eds.), Artificial Intelligence in neuroscience: Affective analysis and health applications. (Vol. 13258). Springer.

    Google Scholar 

  • Joshy, A. A., & Rajan, R. (2022). Automated Dysarthria severity classification: A study on acoustic features and deep learning techniques. IEEE Transactions on Neural Systems and Rehabilitation Engineering: A Publication of the IEEE Engineering in Medicine and Biology Society, 30, 1147–1157.

    Article  Google Scholar 

  • Kathleen, F. R., Fraser, C., & Meltzer, J. A. (2016). Linguistic features identify Alzheimer’s disease in narrative speech. ISSN Journal of Alzheimer’s Disease, 49, 407–422.

    Google Scholar 

  • Kumar, Y., Maheshwari, P., Joshi, S., & Baths, V. (2022). ML-based analysis to identify speech features relevant in predicting Alzheimer's disease. ARXIV computer science and machine learning (2021). arXiv:2110.13023 [cs.LG]

  • Lao, H., & Zhang, X. (2022). Regression and classification of Alzheimer’s disease diagnosis using NMF-TDNet features from 3D brain MR image. IEEE Journal of Biomedical and Health Informatics, 26(3), 1103–1115. https://doi.org/10.1109/JBHI.2021.3113668

    Article  Google Scholar 

  • Liu, X., Tosun, D., Weiner, M. W., & Schuff, N. (2013). Locally linear embedding (LLE) for MRI based Alzheimer’s disease classification. NeuroImage, 83, 148–157. https://doi.org/10.1016/j.neuroimage.2013.06.033

    Article  Google Scholar 

  • Liu, Z., Guo, Z., Ling, Z., & Li, Y. (2021). Detecting Alzheimer’s disease from speech using neural networks with bottleneck features and data augmentation. In ICASSP 2021–2021 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 7323–7327).

  • López, M. M., Ramírez, J., Górriz, J. M., Álvarez, I., Salas-Gonzalez, D., Segovia, F., & Chaves, R. (2009). SVM-based CAD system for early detection of the Alzheimer’s disease using kernel PCA and LDA. Neuroscience Letters, 464(3), 233–238. https://doi.org/10.1016/j.neulet.2009.08.061

    Article  Google Scholar 

  • Luz, S., Haider, F., de la Fuente, S., Fromm, D., & MacWhinney, B. (2021). Detecting cognitive decline using speech only adresso challenge: The adresso challenge. Proceedings Interspeech. https://doi.org/10.21437/Interspeech.2021-1220

    Article  Google Scholar 

  • Nasution, M. Z. F., Sitompul, O. S., & Ramli, M. (2018). PCA based feature reduction to improve the accuracy of decision tree classification. Journal of Physics: Conference Series, 978(1), 012058.

    Google Scholar 

  • Pastoriza-Dominguez, P., Torre, I. G., Dieguez-Vide, F., Gómez-Ruiz, I., Geladó, S., Bello-López, J., & Hernández-Fernández, A. (2022). Speech pause distribution as an early marker for Alzheimer’s disease. Speech Communication, 136, 107–117.

    Article  Google Scholar 

  • Pei, Z. H., & Shen, Q. (2017). Local linear dimensionality reduction algorithm based on nonlinear manifolds decomposition. In 2017 international conference on network and information systems for computers (ICNISC) (pp. 126–130).

  • Roozbeh S., Stephen, A., Zahorian, J. D., & Schaffen, J. D. (2017). Speech processing approach for diagnosing dementia in an early stage. INTERSPEECH August 20–24, 2017

  • Segovia, F., Górriz, J. M., Ramírez, J., Martinez-Murcia, F. J., & García-Pérez, M. (2018). Using deep neural networks along with dimensionality reduction techniques to assist the diagnosis of neurodegenerative disorders. Logic Journal of the IGPL, 26(6), 618–628.

    MathSciNet  Google Scholar 

  • Shellikeri, S., Cho, S., Cousins, K., Liberman, M., Howard, E., Balganorth, Y., Weintraub, D., Spindler, M., Deik, A., Lee, E. B., Trojanowski, J. Q., Irwin, D., Wolk, D., Grossman, M., & Nevler, N. (2022). Natural speech markers of Alzheimer’s disease co-pathology in Lewy body dementias. Parkinsonism & Related Disorders, 102, 94–100.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Benazir Begam.

Ethics declarations

Conflict of interest

The authors have no conflicts of interest to declare. All co-authors have seen and agree with the contents of the manuscript and there is no financial interest to report. We certify that the submission is original work and is not under review at any other publication.

Additional information

Publisher’s Note

Springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Begam, R.B., Palanivelan, M. A speech based diagnostic method for Alzheimer disease using machine learning. Int J Speech Technol 26, 859–867 (2023). https://doi.org/10.1007/s10772-023-10056-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-023-10056-7

Keywords

Navigation