Abstract
Depression is a frequent affective disorder, leading to a high impact on patients, their families and society. Depression diagnosis is limited by assessment methods that rely on patient-reported or clinician judgments of symptom severity. Recently, many researches showed that voice is an objective indicator for depressive diagnosis. In this paper, we investigate a sample of 111 subjects (38 healthy controls, 36 mild depressed patients and 37 severe depressed patients) through comparative analysis to explore the correlation between acoustic features and depression severity. We extract features as many as possible according to previous researches to create a large voice feature set. Then we employ some feature selection methods to form compact subsets on different tasks. Finally, we evaluate depressive disorder severity by these acoustic feature subsets. Results show that interview is a better choice than reading and picture description for depression assessment. Meanwhile, speech signal correlate to depression severity in a medium-level with statistically significant (p < 0.01).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Lecrubier, Y.: Depressive illness and disability. Eur. Neuropsychopharmacol. 10, S439–S443 (2000)
World Health Organization. http://www.who.int/mediacentre/factsheets/fs396/en/
Mundt, J.C., Snyder, P.J., Cannizzaro, M.S., Chappie, K., Geralts, D.S.: Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J. Neurolinguist. 20, 50–64 (2007)
Zung, W.W., Richards, C.B., Short, M.J.: Self-rating depression scale in an outpatient clinic: further validation of the SDS. Arch. Gen. Psychiatry 13, 508–515 (1965)
American Psychiatric Association: DSM-III-R: Diagnostic and Statistical Manual of Mental Disorders. American Psychiatric Association, Arlington (1980)
Horwitz, R., Quatieri, T.F., Helfer, B.S., Yu, B., Williamson, J.R., Mundt, J.: On the relative importance of vocal source, system, and prosody in human depression. In: 2013 IEEE International Conference on Body Sensor Networks (BSN), pp. 1–6. IEEE (2013)
Moore, E., Clements, M., Peifer, J., Weisser, L.: Analysis of prosodic variation in speech for clinical depression. In: Proceedings of the 25th Annual International Conference of the IEEE on Engineering in Medicine and Biology Society, pp. 2925–2928. IEEE (2003)
France, D.J., Shiavi, R.G., Silverman, S., Silverman, M., Wilkes, D.M.: Acoustical properties of speech as indicators of depression and suicidal risk. IEEE Trans. Biomed. Eng. 47, 829–837 (2000)
Quatieri, T.F., Malyska, N.: Vocal-source biomarkers for depression: a link to psychomotor activity. In: Interspeech, pp. 1059–1062
Vicsi, K., Sztaho, D., Kiss, G.: Examination of the sensitivity of acoustic-phonetic parameters of speech to depression. In: 2012 IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom), pp. 511–515. IEEE (2012)
Harel, B., Cannizzaro, M., Snyder, P.J.: Variability in fundamental frequency during speech in prodromal and incipient Parkinson’s disease: a longitudinal case study. Brain Cogn. 56, 24–29 (2004)
Mundt, J.C., Vogel, A.P., Feltner, D.E., Lenderking, W.R.: Vocal acoustic biomarkers of depression severity and treatment response. Biol. Psychiatry 72, 580–587 (2012)
Nilsonne, Å., Sundberg, J., Ternström, S., Askenfelt, A.: Measuring the rate of change of voice fundamental frequency in fluent speech during mental depression. J. Acoust. Soc. Am. 83, 716–728 (1988)
Scripture, E.: A study of emotions by speech transcription. Vox 31, 179–183 (1921)
Ooi, K.E.B., Low, L.-S.A., Lech, M., Allen, N.: Early prediction of major depression in adolescents using glottal wave characteristics and teager energy parameters. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4613–4616. IEEE (2012)
Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Breakspear, M., Parker, G.: Detecting depression: a comparison between spontaneous and read speech. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 7547–7551. IEEE (2013)
Kroencke, K., Spitzer, R., Williams, J.: The phq-9: validity of a brief depression severity measure [electronic version]. J. Gen. Intern. Med. 16, 606–613 (2001)
Hönig, F., Batliner, A., Nöth, E., Schnieder, S., Krajewski, J.: Automatic modelling of depressed speech: relevant features and relevance of gender. In: INTERSPEECH, pp. 1248–1252
ShuJuTang. http://www.datatang.com/data/43216
Bandura, A., Pastorelli, C., Barbaranelli, C., Caprara, G.V.: Self-efficacy pathways to childhood depression. J. Pers. Soc. Psychol. 76, 258 (1999)
Zhou, G., Hansen, J.H., Kaiser, J.F.: Nonlinear feature based classification of speech under stress. IEEE Trans. Speech Audio Process. 9, 201–216 (2001)
Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Gedeon, T., Breakspear, M., Parker, G.: A comparative study of different classifiers for detecting depression from spontaneous speech. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8022–8026. IEEE (2013)
Kurniawan, H., Maslov, A.V., Pechenizkiy, M.: Stress detection from speech and galvanic skin response signals. In: 2013 IEEE 26th International Symposium on Computer-Based Medical Systems (CBMS), pp. 209–214. IEEE (2013)
De Jong, N.H., Wempe, T.: Praat script speech rate (2008). Accessed 14 Oct 2008
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1226–1238 (2005)
Pudil, P., Novovičová, J., Kittler, J.: Floating search methods in feature selection. Pattern Recogn. Lett. 15, 1119–1125 (1994)
Mitchell, T.M.: Machine Learning. WCB/McGraw-Hill, Boston (1997)
Quinlan, J.R.: C4. 5: programs for machine learning. Elsevier (2014)
Hsu, C.-W., Lin, C.-J.: A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 13, 415–425 (2002)
Farzanfar, R., Hereen, T., Fava, J., Davis, J., Vachon, L., Friedman, R.: Psychometric properties of an automated telephone-based PHQ-9. Telemed. e-Health 20, 115–121 (2014)
Spitzer, R.L., Kroenke, K., Williams, J.B., Patient Health Questionnaire Primary Care Study Group: Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. JAMA 282, 1737–1744 (1999)
Löwe, B., Unützer, J., Callahan, C.M., Perkins, A.J., Kroenke, K.: Monitoring depression treatment outcomes with the patient health questionnaire-9. Med. Care 42, 1194–1201 (2004)
Acknowledgment
This work was supported by the National Basic Research Program of China (973 Program) (No. 2014CB744600), the Program of International S&T Cooperation of MOST (No. 2013DFA11140), the National Natural Science Foundation of China (grant No. 61210010, No. 61300231). Grateful acknowledgement is made to my classmates: Xiang Gao, Jinning Zhao, Xin Guo, Fei Heng and Lele He. They gave us considerable help by means of data collection, comments and criticism.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Liu, Z. et al. (2016). Evaluation of Depression Severity in Speech. In: Ascoli, G., Hawrylycz, M., Ali, H., Khazanchi, D., Shi, Y. (eds) Brain Informatics and Health. BIH 2016. Lecture Notes in Computer Science(), vol 9919. Springer, Cham. https://doi.org/10.1007/978-3-319-47103-7_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-47103-7_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47102-0
Online ISBN: 978-3-319-47103-7
eBook Packages: Computer ScienceComputer Science (R0)