Skip to main content

Screening Trauma Through CNN-Based Voice Emotion Classification

  • Conference paper
  • First Online:
Intelligent Human Computer Interaction (IHCI 2020)

Abstract

Recently, modern people experience trauma symptom for various reasons. Trauma causes emotional control problems and anxiety. Although a psychiatric diagnosis is essential, people are reluctant to visit hospitals. In this paper, we propose a method for screening trauma based on voice audio data using convolutional neural networks. Among the six basic emotions, four emotions were used for screening trauma: fear, sad, happy, and neutral. The first pre-processing of adjusting the length of the audio data in units of 2 s and augmenting the number of data, and the second pre-processing is performed in order to convert voice temporal signal into a spectrogram image by short-time Fourier transform. The spectrogram images are trained through the four convolution neural networks. As a result, VGG-13 model showed the highest performance (98.96%) for screening trauma among others. A decision-level fusion strategy as a post-processing is adopted to determine the final traumatic state by confirming the maintenance of the same continuous state for the traumatic state estimated by the trained VGG-13 model. As a result, it was confirmed that high-accuracy voice-based trauma diagnosis is possible according to the setting value for continuous state observation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lee, B.S.: Nature and ethics inherent in the trauma of division. J. Epoch Philos. 22(1), 153–183 (2011)

    Google Scholar 

  2. Ahn, H.N.: Recent trend of trauma treatment. J. Korean Psychol. Assoc. 2014(1), 162 (2014)

    Google Scholar 

  3. So, S.W., et al.: Development of age classification deep learning algorithm using Korean speech. J. Biomed. Eng. Res. 39(2), 63–68 (2018)

    Google Scholar 

  4. Choee, H.W., Park, S.M., Sim, K.B.: CNN-based speech emotion recognition using transfer learning. Int. J. Korean Inst. Intell. Syst. 29(5), 339–344 (2019)

    Google Scholar 

  5. Ekman, P.: An argument for basic emotions. Cogn. Emot. 6(3–4), 169–200 (1992)

    Article  Google Scholar 

  6. Amstadter, A.B., Vernon, L.L.: Emotional reactions during and after trauma: a comparison of trauma types. J. Aggress. Maltreatment Trauma 16(4), 391–408 (2008)

    Article  Google Scholar 

  7. Center for Substance Abuse Treatment: In Trauma-Informed Care in Behavioral Health Services. Substance Abuse and Mental Health Services Administration, US (2014)

    Google Scholar 

  8. Kakao enterprise. https://tech.kakaoenterprise.com/66. Accessed 26 Oct 2020

  9. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

  10. Park, D.S., Bang, J.I., Kim, H.J., Ko, Y.J.: A study on the gender and age classification of speech data using CNN. J. Korean Inst. Inf. Technol. 16(11), 11–21 (2018)

    Article  Google Scholar 

Download references

Acknowledgement

This work was supported by the Industrial Strategic Technology Development Program (No. 10073159) funded by the Ministry of Trade, Industry & Energy (MI, Korea).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eui Chul Lee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kim, N.H., Kim, S.E., Mok, J.W., Yu, S.G., Han, N.Y., Lee, E.C. (2021). Screening Trauma Through CNN-Based Voice Emotion Classification. In: Singh, M., Kang, DK., Lee, JH., Tiwary, U.S., Singh, D., Chung, WY. (eds) Intelligent Human Computer Interaction. IHCI 2020. Lecture Notes in Computer Science(), vol 12615. Springer, Cham. https://doi.org/10.1007/978-3-030-68449-5_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-68449-5_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-68448-8

  • Online ISBN: 978-3-030-68449-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics