ISCA Archive Interspeech 2022
ISCA Archive Interspeech 2022

Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings

Paula Andrea Pérez-Toro, Philipp Klumpp, Abner Hernandez, Tomas Arias, Patricia Lillo, Andrea Slachevsky, Adolfo Martín García, Maria Schuster, Andreas K. Maier, Elmar Noeth, Juan Rafael Orozco-Arroyave

Cross-lingual approaches are growing in popularity in the machine learning domain, where large amounts of data are required to obtain better generalizations. Moreover, one of the biggest problems is the availability of clinical speech data, where most of the resources are in English. For instance, not many available Alzheimer's Disease (AD) corpora in different languages can be found in the literature. Despite the phonological and phonemic differences between Spanish and English, fortunately, there are also similarities between these two languages, e.g., around 40% of all words in English have a related word in Spanish. In this work, we want to investigate the feasibility of combining information from English and Spanish languages to discriminate AD. Two datasets were considered: part of the Pitt Corpus, which is composed of English speakers, and a Spanish AD dataset composed of speakers from Chile. We based our analysis on known acoustic (Wav2Vec) and word (BERT, RoBERTa) embeddings using different classifiers. Strong language dependencies were found, even using multilingual representations. We observed that linguistic information was more important for classifying English AD (F-Score=0.76) and acoustic for Spanish AD (F-Score=0.80). Using knowledge transferred from English to Spanish achieved F-scores of up to 0.85 for discriminating AD.


doi: 10.21437/Interspeech.2022-10883

Cite as: Pérez-Toro, P.A., Klumpp, P., Hernandez, A., Arias, T., Lillo, P., Slachevsky, A., García, A.M., Schuster, M., Maier, A.K., Noeth, E., Orozco-Arroyave, J.R. (2022) Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings. Proc. Interspeech 2022, 2483-2487, doi: 10.21437/Interspeech.2022-10883

@inproceedings{pereztoro22_interspeech,
  author={Paula Andrea Pérez-Toro and Philipp Klumpp and Abner Hernandez and Tomas Arias and Patricia Lillo and Andrea Slachevsky and Adolfo Martín García and Maria Schuster and Andreas K. Maier and Elmar Noeth and Juan Rafael Orozco-Arroyave},
  title={{Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings}},
  year=2022,
  booktitle={Proc. Interspeech 2022},
  pages={2483--2487},
  doi={10.21437/Interspeech.2022-10883}
}