Skip to main content

Evaluation of Depression Severity in Speech

  • Conference paper
  • First Online:
Brain Informatics and Health (BIH 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9919))

Included in the following conference series:

  • 1545 Accesses

Abstract

Depression is a frequent affective disorder, leading to a high impact on patients, their families and society. Depression diagnosis is limited by assessment methods that rely on patient-reported or clinician judgments of symptom severity. Recently, many researches showed that voice is an objective indicator for depressive diagnosis. In this paper, we investigate a sample of 111 subjects (38 healthy controls, 36 mild depressed patients and 37 severe depressed patients) through comparative analysis to explore the correlation between acoustic features and depression severity. We extract features as many as possible according to previous researches to create a large voice feature set. Then we employ some feature selection methods to form compact subsets on different tasks. Finally, we evaluate depressive disorder severity by these acoustic feature subsets. Results show that interview is a better choice than reading and picture description for depression assessment. Meanwhile, speech signal correlate to depression severity in a medium-level with statistically significant (p < 0.01).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Lecrubier, Y.: Depressive illness and disability. Eur. Neuropsychopharmacol. 10, S439–S443 (2000)

    Article  Google Scholar 

  2. World Health Organization. http://www.who.int/mediacentre/factsheets/fs396/en/

  3. Mundt, J.C., Snyder, P.J., Cannizzaro, M.S., Chappie, K., Geralts, D.S.: Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J. Neurolinguist. 20, 50–64 (2007)

    Article  Google Scholar 

  4. Zung, W.W., Richards, C.B., Short, M.J.: Self-rating depression scale in an outpatient clinic: further validation of the SDS. Arch. Gen. Psychiatry 13, 508–515 (1965)

    Article  Google Scholar 

  5. American Psychiatric Association: DSM-III-R: Diagnostic and Statistical Manual of Mental Disorders. American Psychiatric Association, Arlington (1980)

    Google Scholar 

  6. Horwitz, R., Quatieri, T.F., Helfer, B.S., Yu, B., Williamson, J.R., Mundt, J.: On the relative importance of vocal source, system, and prosody in human depression. In: 2013 IEEE International Conference on Body Sensor Networks (BSN), pp. 1–6. IEEE (2013)

    Google Scholar 

  7. Moore, E., Clements, M., Peifer, J., Weisser, L.: Analysis of prosodic variation in speech for clinical depression. In: Proceedings of the 25th Annual International Conference of the IEEE on Engineering in Medicine and Biology Society, pp. 2925–2928. IEEE (2003)

    Google Scholar 

  8. France, D.J., Shiavi, R.G., Silverman, S., Silverman, M., Wilkes, D.M.: Acoustical properties of speech as indicators of depression and suicidal risk. IEEE Trans. Biomed. Eng. 47, 829–837 (2000)

    Article  Google Scholar 

  9. Quatieri, T.F., Malyska, N.: Vocal-source biomarkers for depression: a link to psychomotor activity. In: Interspeech, pp. 1059–1062

    Google Scholar 

  10. Vicsi, K., Sztaho, D., Kiss, G.: Examination of the sensitivity of acoustic-phonetic parameters of speech to depression. In: 2012 IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom), pp. 511–515. IEEE (2012)

    Google Scholar 

  11. Harel, B., Cannizzaro, M., Snyder, P.J.: Variability in fundamental frequency during speech in prodromal and incipient Parkinson’s disease: a longitudinal case study. Brain Cogn. 56, 24–29 (2004)

    Article  Google Scholar 

  12. Mundt, J.C., Vogel, A.P., Feltner, D.E., Lenderking, W.R.: Vocal acoustic biomarkers of depression severity and treatment response. Biol. Psychiatry 72, 580–587 (2012)

    Article  Google Scholar 

  13. Nilsonne, Å., Sundberg, J., Ternström, S., Askenfelt, A.: Measuring the rate of change of voice fundamental frequency in fluent speech during mental depression. J. Acoust. Soc. Am. 83, 716–728 (1988)

    Article  Google Scholar 

  14. Scripture, E.: A study of emotions by speech transcription. Vox 31, 179–183 (1921)

    Google Scholar 

  15. Ooi, K.E.B., Low, L.-S.A., Lech, M., Allen, N.: Early prediction of major depression in adolescents using glottal wave characteristics and teager energy parameters. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4613–4616. IEEE (2012)

    Google Scholar 

  16. Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Breakspear, M., Parker, G.: Detecting depression: a comparison between spontaneous and read speech. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 7547–7551. IEEE (2013)

    Google Scholar 

  17. Kroencke, K., Spitzer, R., Williams, J.: The phq-9: validity of a brief depression severity measure [electronic version]. J. Gen. Intern. Med. 16, 606–613 (2001)

    Article  Google Scholar 

  18. Hönig, F., Batliner, A., Nöth, E., Schnieder, S., Krajewski, J.: Automatic modelling of depressed speech: relevant features and relevance of gender. In: INTERSPEECH, pp. 1248–1252

    Google Scholar 

  19. DUTIR. http://ir.dlut.edu.cn/Group.aspx?ID=4

  20. ShuJuTang. http://www.datatang.com/data/43216

  21. Bandura, A., Pastorelli, C., Barbaranelli, C., Caprara, G.V.: Self-efficacy pathways to childhood depression. J. Pers. Soc. Psychol. 76, 258 (1999)

    Article  Google Scholar 

  22. Zhou, G., Hansen, J.H., Kaiser, J.F.: Nonlinear feature based classification of speech under stress. IEEE Trans. Speech Audio Process. 9, 201–216 (2001)

    Article  Google Scholar 

  23. Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Gedeon, T., Breakspear, M., Parker, G.: A comparative study of different classifiers for detecting depression from spontaneous speech. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8022–8026. IEEE (2013)

    Google Scholar 

  24. Kurniawan, H., Maslov, A.V., Pechenizkiy, M.: Stress detection from speech and galvanic skin response signals. In: 2013 IEEE 26th International Symposium on Computer-Based Medical Systems (CBMS), pp. 209–214. IEEE (2013)

    Google Scholar 

  25. De Jong, N.H., Wempe, T.: Praat script speech rate (2008). Accessed 14 Oct 2008

    Google Scholar 

  26. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)

    MATH  Google Scholar 

  27. Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1226–1238 (2005)

    Article  Google Scholar 

  28. Pudil, P., Novovičová, J., Kittler, J.: Floating search methods in feature selection. Pattern Recogn. Lett. 15, 1119–1125 (1994)

    Article  Google Scholar 

  29. Mitchell, T.M.: Machine Learning. WCB/McGraw-Hill, Boston (1997)

    MATH  Google Scholar 

  30. Quinlan, J.R.: C4. 5: programs for machine learning. Elsevier (2014)

    Google Scholar 

  31. Hsu, C.-W., Lin, C.-J.: A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 13, 415–425 (2002)

    Article  Google Scholar 

  32. Farzanfar, R., Hereen, T., Fava, J., Davis, J., Vachon, L., Friedman, R.: Psychometric properties of an automated telephone-based PHQ-9. Telemed. e-Health 20, 115–121 (2014)

    Article  Google Scholar 

  33. Spitzer, R.L., Kroenke, K., Williams, J.B., Patient Health Questionnaire Primary Care Study Group: Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. JAMA 282, 1737–1744 (1999)

    Article  Google Scholar 

  34. Löwe, B., Unützer, J., Callahan, C.M., Perkins, A.J., Kroenke, K.: Monitoring depression treatment outcomes with the patient health questionnaire-9. Med. Care 42, 1194–1201 (2004)

    Article  Google Scholar 

Download references

Acknowledgment

This work was supported by the National Basic Research Program of China (973 Program) (No. 2014CB744600), the Program of International S&T Cooperation of MOST (No. 2013DFA11140), the National Natural Science Foundation of China (grant No. 61210010, No. 61300231). Grateful acknowledgement is made to my classmates: Xiang Gao, Jinning Zhao, Xin Guo, Fei Heng and Lele He. They gave us considerable help by means of data collection, comments and criticism.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bin Hu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Liu, Z. et al. (2016). Evaluation of Depression Severity in Speech. In: Ascoli, G., Hawrylycz, M., Ali, H., Khazanchi, D., Shi, Y. (eds) Brain Informatics and Health. BIH 2016. Lecture Notes in Computer Science(), vol 9919. Springer, Cham. https://doi.org/10.1007/978-3-319-47103-7_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-47103-7_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-47102-0

  • Online ISBN: 978-3-319-47103-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics