Speech Recognition for Emotions with Neural Network: A Design Approach

Giripunje, Shubhangi; Panat, Ashish

doi:10.1007/978-3-540-30133-2_84

Shubhangi Giripunje²¹ &
Ashish Panat²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3214))

Included in the following conference series:

International Conference on Knowledge-Based and Intelligent Information and Engineering Systems

921 Accesses
2 Citations

Abstract

Worldwide research is going on to judge the emotional state of a speaker just from the quality of human voice. This paper explores use of supervised neural network to design a classifier that can discriminate between several emotions like happiness, anger, fear, sadness & unemotional state in speech. The results found to be are significant, both in cognitive science and in speech technology. In the current paper, statistics of the pitch like, first and second formants, and Energy and speaking rate are used as relevant features. Different neural network based recognizers are created. Ensembles of such recognizers are used as an important part of decision support system for prioritizing voice messages and assigning a proper agent to response the message. The developed intelligent system can be enhanced to automatically predict and adapt to detect people’s emotional states and also to design emotional robot or computer system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Picard, R.: Affective computing. The MIT Press, Cambridge (1997)
Google Scholar
Canh, J.E.: Generation of Affect in Synthesized Speech. In: Proceedings of AVIOS 1989, Meeting of the American Voice Input/Output Society (1989)
Google Scholar
Murray, I.R., Arnott, J.L.: 1993 toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotions. J. Acoust. Society of America 93(2), 1097–1108 (1993)
Article Google Scholar
Dellaert, F., Polzin, T., Waibel, A.: 1996 Recognizing emotions in speech. In: ICSLP 1996 (1996)
Google Scholar
Tosa, N., Nakatsu, R.: 1996 Life-like communication agent - emotion sensing character "MIC" and feeling session character "MUSE". In: Proc. of IEEE conference on Multimedia 1996, pp. 12–19 (1996)
Google Scholar
Banse, R., Scherer, K.R.: 1996 Acoustic profiles in vocal emotion expression. Journal of Personality and Social Psychology 70, 614–636 (1996)
Article Google Scholar
Scherer, K.R.: Expression of emotion in voice and music. J. Voice 9(3), 235–248 (1995)
Article Google Scholar
[Cohn/Katz 1998] Cohn, J.F., Katz, G.S.: Bimodal Expressions of Emotion by Face and Voice. Workshop on Face/Gesture Recognition and their Applications. In: The Sixth ACM International Multimedia Conference, Bristol, England (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

G. H. Raisoni college of Engineering, Nagpur, India
Shubhangi Giripunje
College of Engineering, Bandera, India
Ashish Panat

Authors

Shubhangi Giripunje
View author publications
You can also search for this author in PubMed Google Scholar
Ashish Panat
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

KES International, 2nd Floor, 145-157 St John Street, EC1V 4PY, London, United Kingdom
Mircea Gh. Negoita
Centre for SMART systems Engineering Research Centre, University of Brighton, BN2 4GJ, Moulsecoomb, Brighton, UK
Robert J. Howlett
School of Electrical and Information Engineering, Knowledge Based Intelligent Engineering Systems Centre, University of South Australia, Mawson Lakes, 5095, Mawson Lakes, SA, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Giripunje, S., Panat, A. (2004). Speech Recognition for Emotions with Neural Network: A Design Approach. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2004. Lecture Notes in Computer Science(), vol 3214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30133-2_84

Download citation

DOI: https://doi.org/10.1007/978-3-540-30133-2_84
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23206-3
Online ISBN: 978-3-540-30133-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics