Abstract
This paper proposes the application of nonlinear measures based on nonlinear dynamics for emotional speech characterization. Measures such as mutual information, dimension correlation, entropy correlation, Shannon entropy, Lempel-Ziv complexity and Hurst exponent are extracted from the samples of a database of emotional speech. Then, statistics such as mean, standard deviation, skewness and kurtosis are applied on the extracted measures. Experiments were conducted on the Berlin emotional speech database for a threeclass problem (neutral, fear and anger emotional states). Feature selection is accomplished to select a reduced number of features. In order to evaluate the discrimination ability of the selected features a neural network classifier is used. A global success rate of 93.78% is obtained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Yildirim, S., Narayanan, S., Potamianos, A.: Detecting emotional state of a child in a conversational computer game. Computer Speech and Language 25, 29–44 (2011)
Burkhardt, F., Polzehl, T., Stegmann, J., Metze, F., Huber, R.: Detecting real life anger. In: IEEE Int. Conf. on Acoustics, Speech and Signal Processing, pp. 4761–4764. IEEE Press, New York (2009)
Lefter, I., Rothkrantz, L.J.M., van Leeuwen, D.A., Wiggers, P.: Automatic Stress Detection in Emergency (Telephone) Calls. Int. J. of Intelligent Defence Support Systems 4(2), 148–168 (2011)
Polzehl, T., Schmitt, A., Metze, F., Wagner, M. Anger recognition in speech using acoustic and linguistic cues. Speech Comm. (2011), doi:10.1016/j.specom.2011.05.002
Wu, S., Falk, T.H., Wai-Yip, C.: Automatic recognition of speech emotion using long-term spectro-temporal features. In: 16th Int. Conf. on Digital Signal Proc., July 5-7, pp. 1–6 (2009)
Giannakopoulos, T., Pikrakis A., Theodoridis, S.A.: Dimensional Approach to Emotion Recognition of Speech from Movies. In: IEEE Int. Conf. on Acoustic, Speech and Signal Proc., pp. 65–68 (2009)
Schuller, B., Batliner, A., Steidl, S., Seppi, D.: Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge. Speech Comm. (2011), doi:10.1016/j.specom.2011.01.011.
A Database of German Emocional Speech, http://pascal.kgw.tu-berlin.de/emodb/
Wu, S., Falk, T.H., Wai-Yip, C.: Automatic speech emotion recognition using modulation spectral features. Speech Comm. 53, 768–785 (2011)
Henríquez, P., Alonso, J.B., Ferrer, M.A., Travieso, C.M., Godino-Llorente, J.I., Díaz-de-María, F.: Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics. IEEE Trans. on Audio, Speech and Language Proc. 17(6) (2009)
Alonso, J.B., Díaz-de-María, F., Travieso, C.M., Ferrer, M.A.: Using nonlinear features for voice disorder detection. In: Proc. 3rd Int. Conf. Nonlinear Speech Process., Barcelona, Spain, pp. 94–106 (2005)
Vaziri, G., Almasganj, F., Jenabi, M.S.: On the Fractal Self- Similarity of Laryngeal Pathologies Detection: The estimation of Hurst parameter. In: Proc. of the 5th Int. Conf. on Inf. Technology and Application in Biomedicine, Shenzhen, China (2008)
Vaziri, G., Almasganj, F., Behroozmand, R.: Pathological assessment of patients’ speech signals using nonlinear dynamical analysis. Computers in Biology and Medicine 40, 54–63 (2010)
Takens, F.: Detecting strange attractors in turbulence. Lecture Notes in Math., vol. 898, pp. 366–381. Springer, New York (1981)
Fraser, A.M., Swinney, H.L.: Independent coordinates for strange attractors from mutual information. Phys. Rev. A 33, 1134 (1986)
Kennel, M.B., Brown, R., Abarbanel, H.D.I.: Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys. Rev. A 45, 3403–3411 (1992)
Kantz, H., Schreiber, T.: Nonlinear Time Series Analysis. Univ. Press, Cambridge (1997)
Theiler, J.: Lacunarity in a best estimator of fractal dimension. Phys. Lett. A 133, 195–200 (1988)
Kaspar, F., Shuster, H.G.: Easily calculable measure for complexity of spatiotemporal patterns. Phys. Rev. A 36, 842–848 (1987)
Lempel, A., Ziv, J.: On the complexity of finite sequences. IEEE Trans. Inform. Theory 22, 75–81 (1976)
Hurst, H.E., Black, R.P., Simaika, Y.M.: Long-term storage: an experimental study, London (1965)
Pudil, P., Novovicová, J., Kittler, J.: Floating search methods in feature selection. Pattern Recognition Letters 15, 1119–1125 (1994)
Ruelle, D.: Deterministic chaos: the science and the fiction. Proc. R. Soc. Lond. A 427, 241–248 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Henríquez, P., Alonso, J.B., Ferrer, M.A., Travieso, C.M., Orozco-Arroyave, J.R. (2011). Application of Nonlinear Dynamics Characterization to Emotional Speech. In: Travieso-González, C.M., Alonso-Hernández, J.B. (eds) Advances in Nonlinear Speech Processing. NOLISP 2011. Lecture Notes in Computer Science(), vol 7015. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25020-0_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-25020-0_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25019-4
Online ISBN: 978-3-642-25020-0
eBook Packages: Computer ScienceComputer Science (R0)