Representative and Discriminant Feature Extraction Based on NMF for Emotion Recognition in Speech

Kim, Dami; Lee, Soo-Young; Amari, Shun-ichi

doi:10.1007/978-3-642-10677-4_74

Representative and Discriminant Feature Extraction Based on NMF for Emotion Recognition in Speech

Dami Kim^19,21,
Soo-Young Lee^19,20,21 &
Shun-ichi Amari²¹

Conference paper

1582 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5863))

Abstract

For the emotion recognition in speech we had developed two feature extraction algorithms, which emphasize the subtle emotional differences while de-emphasizing the dominant linguistic components. The starting point is to extract 200 statistical features based on intensity and pitch time series, which are considered as the superset of necessary emotional features. Then, the first algorithm, rNMF (representative Non-negative Matrix Factorization), selects simple features best representing the complex NMF-based features. It first extracts a large set of complex almost-mutually-independent features by unsupervised learning and latter selects a small number of simple features for the classification tasks. The second algorithm, dNMF (discriminant NMF), extracts only the discriminate features by adding Fisher criterion as an additional constraint on the cost function of the standard NMF algorithm. Both algorithms demonstrate much better recognition rates even with only 20 features for the popular Berlin database.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Slaney, M., McRoberts, G.: Baby ears: a recognition system for affective vocalizations. Speech Communications 39, 367–384 (2003)
Article MATH Google Scholar
Lin, Y., Wei, G.: Speech emotion recognition based on HMM and SVM. In: Proceedings of the Fourth International Conference on Machine Learning and Cybernetics, August 2005, vol. 8, pp. 4898–4901 (2005)
Google Scholar
You, M., Chen, C., Bu, J., Liu, J., Tao, J.: Emotional speech analysis on nonlinear manifold. In: 18th International Conference on Pattern Recognition, September 2006, vol. 3, pp. 91–94 (2006)
Google Scholar
Zhou, G., Hansen, J.H.L., Kaiser, J.F.: Nonlinear feature based classification of speech under stress. IEEE Transactions on Speech and Audio Processing 9, 201–216 (2001)
Article Google Scholar
Oudeyer, P.Y.: The production and recognition of emotions in speech: features and algorithms. International Journal of Human-Computer Studies 59(1), 157–183 (2003)
Article Google Scholar
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)
Article Google Scholar
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., Weiss, B.: A database of german emotional speech. In: Proceeding INTERSPEECH 2005, ISCA, pp. 1517–1520 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Brain Science Research Center and Department of Bio and Brain Engineering, KAIST,
Dami Kim & Soo-Young Lee
Department of Electrical Engineering, KAIST, 373-1 Guseong-dong, Yuseong-gu, Daejeon, 305-701, Korea (South)
Soo-Young Lee
Mathematical Neuroscience laboratory, Brain Science Institute, RIKEN, 2-2 Hirosawa, Wako-shi, Saitama, 351-0198, Japan
Dami Kim, Soo-Young Lee & Shun-ichi Amari

Authors

Dami Kim
View author publications
You can also search for this author in PubMed Google Scholar
Soo-Young Lee
View author publications
You can also search for this author in PubMed Google Scholar
Shun-ichi Amari
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electronic Engineering, City University of Hong Kong, Hong Kong,
Chi Sing Leung
School of Electrical Engineering and Computer Science, Kyungpook National University, 1370 Sankyuk-Dong, Puk-Gu, 702-701, Taegu, Korea
Minho Lee
School of Information Technology, King Mongkut’s University of Technology Thonburi, 126 Pracha-U-Thit Rd., Bangmod, Thungkru, 10140, Bangkok, Thailand
Jonathan H. Chan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, D., Lee, SY., Amari, Si. (2009). Representative and Discriminant Feature Extraction Based on NMF for Emotion Recognition in Speech. In: Leung, C.S., Lee, M., Chan, J.H. (eds) Neural Information Processing. ICONIP 2009. Lecture Notes in Computer Science, vol 5863. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10677-4_74

Download citation

DOI: https://doi.org/10.1007/978-3-642-10677-4_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10676-7
Online ISBN: 978-3-642-10677-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics