Abstract
The goal of this work is the exploration of real-time emotion recognition from speech. In this approach a novel type of recurrent neural networks called echo state networks (ESN) are utilized. Biologically motivated features representing modulations of the speech signal are used as input to the ESNs. The standard Berlin Database of Emotional Speech is used to evaluate the performance of the proposed approach. However, in this paper ongoing work is being presented and the final architecture has yet to be determined.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., Weiss, B.: A database of german emotional speech. In: Proceedings of Interspeech 2005 (2005)
Calder, A.J., Keane, J., Manes, F., Antoun, N., Young, A.W.: Impaired recognition and experience of disgust following brain injury. Nature Neuroscience 3, 1077–1078 (2000)
Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., Taylor, J.: Emotion recognition in human-computer interaction. IEEE Signal Processing Magazine 18(1), 32–80 (2001)
Jaeger, H.: Tutorial on training recurrent neural networks, covering bppt, rtrl, ekf and the echo state network approach. Technical Report 159, Fraunhofer-Gesellschaft, St. Augustin Germany (2002)
Kuncheva, L.: Combining pattern classifiers: methods and algorithms. Wiley, Chichester (2004)
Petrushin, V.: Emotion in speech: recognition and application to call centers. In: Proceedings of Artificial Neural Networks in Engineering (1999)
Picard, R.W.: Affective Computing. MIT Press, Cambridge (2000)
Scherer, K.R., Johnstone, T., Klasmeyer, G.: Handbook of Affective Sciences - Vocal expression of emotion, ch.23. In: Affective Science, pp. 433–456. Oxford University Press, Oxford (2003)
Scherer, S., Oubbati, M., Schwenker, F., Palm, G.: Real-time emotion recognition from speech using echo state networks. In: Proceedings of ANNPR (submission, 2008)
Scherer, S., Schwenker, F., Palm, G.: Classifier fusion for emotion recognition from speech. In: Proceedings of Intelligent Environments 2007 (2007)
Yacoub, S., Simske, S., Lin, X., Burns, J.: Recognition of emotions in interactive voice response systems. In: Proceedings of Eurospeech 2003 (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Scherer, S., Oubbati, M., Schwenker, F., Palm, G. (2008). Real-Time Emotion Recognition Using Echo State Networks. In: André, E., Dybkjær, L., Minker, W., Neumann, H., Pieraccini, R., Weber, M. (eds) Perception in Multimodal Dialogue Systems. PIT 2008. Lecture Notes in Computer Science(), vol 5078. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69369-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-69369-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69368-0
Online ISBN: 978-3-540-69369-7
eBook Packages: Computer ScienceComputer Science (R0)