Statistical Feature Selection for Mandarin Speech Emotion Recognition

Xie, Bo; Chen, Ling; Chen, Gen-Cai; Chen, Chun

doi:10.1007/11538059_62

Bo Xie¹⁹,
Ling Chen¹⁹,
Gen-Cai Chen¹⁹ &
…
Chun Chen¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3644))

Included in the following conference series:

International Conference on Intelligent Computing

4303 Accesses
3 Citations

Abstract

Performance of speech emotion recognition largely depends on the acoustic features used in a classifier. This paper studies the statistical feature selection problem in Mandarin speech emotion recognition. This study was based on a speaker dependent emotional mandarin database. Pitch, energy, duration, formant related features and some velocity information were selected as base features. Some statistics of them consisted of original feature set and full stepwise discriminant analysis (SDA) was employed to select extracted features. The results of feature selection were evaluated through a LDA based classifier. Experiment results indicate that pitch, log energy, speed and 1st formant are the most important factors and the accuracy rate increases from 63.1 % to 76.5 % after feature selection. Meanwhile, the features selected by SDA are better than the results of other feature selection methods in a LDA based classifier and SVM. The best performance is achieved when the feature number is in the range of 9 to 12.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Picard, R.W.: Affective Computing. MIT Press, Cambridge (1997)
Google Scholar
Murray, I.R., Arnott, J.L.: Toward the Simulation of Emotion in Synthetic Speech: A Review of the Literature on Human Vocal Emotion. Journal of the Acoustical Society of America 93(2), 1097–1108 (1933)
Article Google Scholar
Dellaert, F., Polzin, T., Waibel, A.: Recognizing Emotion in Speech. In: Proceedings of International Conference on Spoken Language Processing, pp. 1970–1973 (1996)
Google Scholar
Lee, C.M., Narayanan, S., Pieraccini, R.: Recognition of Negative Emotions from the Speech Signal. In: Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 240–243 (2001)
Google Scholar
Kwon, O.W., Chan, K., Hao, J., Lee, T.W.: Emotion Recognition by Speech Signals. In: Proceedings of EUROSPEECH, pp. 125–128 (2003)
Google Scholar
Wang, Z.P., Zhao, L., Zou, C.R.: Emotion Recognition of Speech using Fuzzy Entropy Effectiveness Analysis. Journal of circuits and systems 8(3), 109–112 (2003)
Google Scholar
James, M.: Classification Algorithms. John Wiley & Sons, London (1985)
MATH Google Scholar
Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., et al.: Emotion Recognition in Human-computer Interaction. IEEE Signal Processing Magazine 18(1), 32–80 (2001)
Article Google Scholar
Cai, L.L., Jiang, C.H., Wang, Z.P.: A Method Combining the Global and Time Series Structure Features for Emotion Recognition in Speech. In: Proceedings of International Conference on Neural Networks and Signal Processing, pp. 904–907 (2003)
Google Scholar
Boersma, P., Weenink, D.: Praat Speech Processing Software. Institute of Phonetics Sciences of the University of Amsterdam, http://www.praat.org

Download references

Author information

Authors and Affiliations

College of Computer Science, Zhejiang University, Hangzhou, 310027, P.R. China
Bo Xie, Ling Chen, Gen-Cai Chen & Chun Chen

Authors

Bo Xie
View author publications
You can also search for this author in PubMed Google Scholar
Ling Chen
View author publications
You can also search for this author in PubMed Google Scholar
Gen-Cai Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chun Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Intelligent Computing Lab, Institute of Intelligent Machines, Chinese Academy of Sciences,, China
De-Shuang Huang
School of Computer & Information Technology, Beijing Jiaotong University, 100044, Beijing, P.R. China
Xiao-Ping Zhang
School of Electrical and Electronic Engineering, Nanyang Technological University, P.O. Box, Singapore
Guang-Bin Huang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, B., Chen, L., Chen, GC., Chen, C. (2005). Statistical Feature Selection for Mandarin Speech Emotion Recognition. In: Huang, DS., Zhang, XP., Huang, GB. (eds) Advances in Intelligent Computing. ICIC 2005. Lecture Notes in Computer Science, vol 3644. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11538059_62

Download citation

DOI: https://doi.org/10.1007/11538059_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28226-6
Online ISBN: 978-3-540-31902-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics