Skip to main content

Application of Nonlinear Dynamics Characterization to Emotional Speech

  • Conference paper
Advances in Nonlinear Speech Processing (NOLISP 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7015))

Included in the following conference series:

Abstract

This paper proposes the application of nonlinear measures based on nonlinear dynamics for emotional speech characterization. Measures such as mutual information, dimension correlation, entropy correlation, Shannon entropy, Lempel-Ziv complexity and Hurst exponent are extracted from the samples of a database of emotional speech. Then, statistics such as mean, standard deviation, skewness and kurtosis are applied on the extracted measures. Experiments were conducted on the Berlin emotional speech database for a threeclass problem (neutral, fear and anger emotional states). Feature selection is accomplished to select a reduced number of features. In order to evaluate the discrimination ability of the selected features a neural network classifier is used. A global success rate of 93.78% is obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Yildirim, S., Narayanan, S., Potamianos, A.: Detecting emotional state of a child in a conversational computer game. Computer Speech and Language 25, 29–44 (2011)

    Article  Google Scholar 

  2. Burkhardt, F., Polzehl, T., Stegmann, J., Metze, F., Huber, R.: Detecting real life anger. In: IEEE Int. Conf. on Acoustics, Speech and Signal Processing, pp. 4761–4764. IEEE Press, New York (2009)

    Google Scholar 

  3. Lefter, I., Rothkrantz, L.J.M., van Leeuwen, D.A., Wiggers, P.: Automatic Stress Detection in Emergency (Telephone) Calls. Int. J. of Intelligent Defence Support Systems 4(2), 148–168 (2011)

    Article  Google Scholar 

  4. Polzehl, T., Schmitt, A., Metze, F., Wagner, M. Anger recognition in speech using acoustic and linguistic cues. Speech Comm. (2011), doi:10.1016/j.specom.2011.05.002

    Google Scholar 

  5. Wu, S., Falk, T.H., Wai-Yip, C.: Automatic recognition of speech emotion using long-term spectro-temporal features. In: 16th Int. Conf. on Digital Signal Proc., July 5-7, pp. 1–6 (2009)

    Google Scholar 

  6. Giannakopoulos, T., Pikrakis A., Theodoridis, S.A.: Dimensional Approach to Emotion Recognition of Speech from Movies. In: IEEE Int. Conf. on Acoustic, Speech and Signal Proc., pp. 65–68 (2009)

    Google Scholar 

  7. Schuller, B., Batliner, A., Steidl, S., Seppi, D.: Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge. Speech Comm. (2011), doi:10.1016/j.specom.2011.01.011.

    Google Scholar 

  8. A Database of German Emocional Speech, http://pascal.kgw.tu-berlin.de/emodb/

  9. Wu, S., Falk, T.H., Wai-Yip, C.: Automatic speech emotion recognition using modulation spectral features. Speech Comm. 53, 768–785 (2011)

    Article  Google Scholar 

  10. Henríquez, P., Alonso, J.B., Ferrer, M.A., Travieso, C.M., Godino-Llorente, J.I., Díaz-de-María, F.: Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics. IEEE Trans. on Audio, Speech and Language Proc. 17(6) (2009)

    Google Scholar 

  11. Alonso, J.B., Díaz-de-María, F., Travieso, C.M., Ferrer, M.A.: Using nonlinear features for voice disorder detection. In: Proc. 3rd Int. Conf. Nonlinear Speech Process., Barcelona, Spain, pp. 94–106 (2005)

    Google Scholar 

  12. Vaziri, G., Almasganj, F., Jenabi, M.S.: On the Fractal Self- Similarity of Laryngeal Pathologies Detection: The estimation of Hurst parameter. In: Proc. of the 5th Int. Conf. on Inf. Technology and Application in Biomedicine, Shenzhen, China (2008)

    Google Scholar 

  13. Vaziri, G., Almasganj, F., Behroozmand, R.: Pathological assessment of patients’ speech signals using nonlinear dynamical analysis. Computers in Biology and Medicine 40, 54–63 (2010)

    Article  Google Scholar 

  14. Takens, F.: Detecting strange attractors in turbulence. Lecture Notes in Math., vol. 898, pp. 366–381. Springer, New York (1981)

    MATH  Google Scholar 

  15. Fraser, A.M., Swinney, H.L.: Independent coordinates for strange attractors from mutual information. Phys. Rev. A 33, 1134 (1986)

    Article  MathSciNet  MATH  Google Scholar 

  16. Kennel, M.B., Brown, R., Abarbanel, H.D.I.: Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys. Rev. A 45, 3403–3411 (1992)

    Article  Google Scholar 

  17. Kantz, H., Schreiber, T.: Nonlinear Time Series Analysis. Univ. Press, Cambridge (1997)

    MATH  Google Scholar 

  18. Theiler, J.: Lacunarity in a best estimator of fractal dimension. Phys. Lett. A 133, 195–200 (1988)

    Article  MathSciNet  Google Scholar 

  19. Kaspar, F., Shuster, H.G.: Easily calculable measure for complexity of spatiotemporal patterns. Phys. Rev. A 36, 842–848 (1987)

    Article  MathSciNet  Google Scholar 

  20. Lempel, A., Ziv, J.: On the complexity of finite sequences. IEEE Trans. Inform. Theory 22, 75–81 (1976)

    Article  MathSciNet  MATH  Google Scholar 

  21. Hurst, H.E., Black, R.P., Simaika, Y.M.: Long-term storage: an experimental study, London (1965)

    Google Scholar 

  22. Pudil, P., Novovicová, J., Kittler, J.: Floating search methods in feature selection. Pattern Recognition Letters 15, 1119–1125 (1994)

    Article  Google Scholar 

  23. Ruelle, D.: Deterministic chaos: the science and the fiction. Proc. R. Soc. Lond. A 427, 241–248 (1990)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Henríquez, P., Alonso, J.B., Ferrer, M.A., Travieso, C.M., Orozco-Arroyave, J.R. (2011). Application of Nonlinear Dynamics Characterization to Emotional Speech. In: Travieso-González, C.M., Alonso-Hernández, J.B. (eds) Advances in Nonlinear Speech Processing. NOLISP 2011. Lecture Notes in Computer Science(), vol 7015. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25020-0_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25020-0_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25019-4

  • Online ISBN: 978-3-642-25020-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics