Skip to main content

Is There Any Additional Information in a Neural Network Trained for Pathological Speech Classification?

  • Conference paper
  • First Online:
Text, Speech, and Dialogue (TSD 2021)

Abstract

Speech is a biomarker extensively explored by the scientific community for different health-care applications because its reduced cost and non-intrusiveness. Specifically, in Parkinson’s disease, speech signals and deep learning methods have been explored for the automatic assessment and monitoring of patients. Related studies have shown to be very accurate to discriminate pathological vs. healthy speech. In spite of the high accuracies observed to detect the presence of diseases from speech, it is not clear which additional information about the speakers or the environment is implicitly learned by the deep learning systems. This study proposes a methodology to evaluate intermediate representations of a neural network in order to find out which other speaker traits and aspects are learned by the system during the training process. We trained models to detect the presence of Parkinson’s disease from speech. Then, we used intermediate representations of the network to classify additional speaker traits such as gender, age, and the native language. It is important to detect which information is available inside the neural network that can lead to open the black-box and to detect possible algorithmic biases. The results indicate that the network, in addition to adjusting its parameters for disease classification, also acquires knowledge about gender of the speakers in the first layers, and about speech tasks and the native language in the last layers of the network.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arias-Vergara, T., et al.: Automatic detection of voice onset time in voiceless plosives using gated recurrent units. Digital Signal Process. 104, 102779 (2020)

    Google Scholar 

  2. Bocklet, T., et al.: Automatic evaluation of parkinson’s speech-acoustic, prosodic and voice related cues. In: Proceedings of INTERSPEECH, pp. 1149–1153 (2013)

    Google Scholar 

  3. Caliskan, A., et al.: Diagnosis of the parkinson disease by using deep neural network classifier. Istanbul University J. Electr. Electron. Eng. 17(2), 3311–3318 (2017)

    MathSciNet  Google Scholar 

  4. Goetz, C.G., et al.: Movement Disorder Society-sponsored revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS): Scale presentation and clinimetric testing results. Mov. Disord. 23(15), 2129–2170 (2008)

    Article  Google Scholar 

  5. Grósz, T., et al.: Assessing the degree of nativeness and parkinson’s condition using gaussian processes and deep rectifier neural networks. In: Proceedings of INTERSPEECH (2015)

    Google Scholar 

  6. He, K., Zhang, X., Ren, S., Sun, J.: Identity mappings in deep residual networks. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 630–645. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_38

    Chapter  Google Scholar 

  7. Jankovic, J.: Parkinson’s disease: clinical features and diagnosis. J. Neurol. Neurosurge. Psychiatry 79(4), 368–376 (2008)

    Article  Google Scholar 

  8. Mallela, J., et al.: Voice based classification of patients with amyotrophic lateral sclerosis, parkinson’s disease and healthy controls with CNN-LSTM using transfer learning. In: Proceedings of ICASSP, pp. 6784–6788. IEEE (2020)

    Google Scholar 

  9. McKinlay, A., et al.: A profile of neuropsychiatric problems and their relationship to quality of life for parkinson’s disease patients without dementia. Parkinsonism Related Disorders 14(1), 37–42 (2008)

    Article  Google Scholar 

  10. Novotný, M., et al.: Glottal source analysis of voice deficits in newly diagnosed drug-naïve patients with parkinson’s disease: Correlation between acoustic speech characteristics and non-speech motor performance. Biomed. Signal Process. Control 57, 101818 (2020)

    Article  Google Scholar 

  11. Orozco-Arroyave, J.R., et al.: New Spanish speech corpus database for the analysis of people suffering from parkinson’s disease. In: Proceedings of LREC, pp. 342–347 (2014)

    Google Scholar 

  12. Orozco-Arroyave, J.R., et al.: Apkinson: the smartphone application for telemonitoring parkinson’s patients through speech, gait and hands movement. Neurodegenerative Dis. Manage. 10(3), 137–157 (2020)

    Google Scholar 

  13. Rios-Urrego, C.D., Vásquez-Correa, J.C., Orozco-Arroyave, J.R., Nöth, E.: Transfer learning to detect parkinson’s disease from speech in different languages using convolutional neural networks with layer freezing. In: Sojka, P., Kopeček, I., Pala, K., Horák, A. (eds.) TSD 2020. LNCS (LNAI), vol. 12284, pp. 331–339. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58323-1_36

    Chapter  Google Scholar 

  14. Rizvi, D.R., et al.: An LSTM based deep learning model for voice-based detection of parkinson’s disease. Int. J. Adv. Sci. Technol. 29(5), 8 (2020)

    MathSciNet  Google Scholar 

  15. Rusz, J.: Detecting speech disorders in early Parkinson’s disease by acoustic analysis. Habilitation thesis, Czech Technical University in Prague (2018)

    Google Scholar 

  16. Spencer, K.A., Rogers, M.A.: Speech motor programming in hypokinetic and ataxic dysarthria. Brain Lang. 94(3), 347–366 (2005)

    Article  Google Scholar 

  17. Thompson, J.A., et al.: How transferable are features in convolutional neural network acoustic models across languages? In: Proceedings of ICASSP, pp. 2827–2831. IEEE (2019)

    Google Scholar 

  18. Vásquez-Correa, J.C., Orozco-Arroyave, J.R., Nöth, E.: Convolutional neural network to model articulation impairments in patients with Parkinson’s disease. In: Proceedings of INTERSPEECH, pp. 314–318 (2017)

    Google Scholar 

  19. Vavrek, L., et al.: Deep convolutional neural network for detection of pathological speech. In: Proceedings of SAMI, pp. 000245–000250. IEEE (2021)

    Google Scholar 

  20. Wodzinski, M., et al.: Deep learning approach to parkinson’s disease detection using voice recordings and convolutional neural network dedicated to image classification. In: Proceedings of EMBC, pp. 717–720. IEEE (2019)

    Google Scholar 

  21. Wu, H., et al.: Convolutional neural networks for pathological voice detection. In: Proceedings of EMBC, pp. 1–4. IEEE (2018)

    Google Scholar 

Download references

Acknowledgments

The work reported here was financed by CODI from University of Antioquia by grant Number 2017-15530. This project has received funding from the European Unions Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Grant Agreement No. 766287.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to C. D. Rios-Urrego .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rios-Urrego, C.D., Vásquez-Correa, J.C., Orozco-Arroyave, J.R., Nöth, E. (2021). Is There Any Additional Information in a Neural Network Trained for Pathological Speech Classification?. In: Ekštein, K., Pártl, F., Konopík, M. (eds) Text, Speech, and Dialogue. TSD 2021. Lecture Notes in Computer Science(), vol 12848. Springer, Cham. https://doi.org/10.1007/978-3-030-83527-9_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-83527-9_37

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-83526-2

  • Online ISBN: 978-3-030-83527-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics