Abstract
This work aims at investigating the use of relevance vector machine (RVM) for speech emotion recognition. The RVM technique is a Bayesian extension of the support vector machine (SVM) that is based on a Bayesian formulation of a linear model with an appropriate prior for each weight. Together with the introduction of RVM, aspects related to the use of SVM are also presented. From the comparison between the two classifiers, we find that RVM achieves comparable results to SVM, while using a sparser representation, such that it can be advantageously used for speech emotion recognition.
Keywords
- Support Vector Machine
- Feature Selection
- Emotion Recognition
- Feature Selection Technique
- Relevance Vector Machine
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Vogt, T., André, E.: Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition. In: 2005 IEEE International Conference on Multimedia and Expo., pp. 474–477 (2005)
Batliner, A., Steidl, S., Schuller, B., Seppi, D., et al.: Combing efforts for improving automatic classification of emotional user states. In: Language Technologies, IS-LTC, pp. 240–245 (2006)
Wagner, J., Vogt, T., André, E.: A systematic comparison of different HMM designs for emotion recognition from acted and spontaneous speech. In: Paiva, A.C.R., Prada, R., Picard, R.W. (eds.) ACII 2007. LNCS, vol. 4738, pp. 114–125. Springer, Heidelberg (2007)
Schuller, B., Rigoll, G., Lang, M.: Hidden markov model-based speech emotion recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2 (2003)
Shami, M., Verhelst, W.: An evaluation of the robustness of existing supervised machine learning approaches to the classification of emotions in speech. Speech Communication 49, 201–212 (2007)
Tipping, M.E.: Sparse bayesian learning and the relevance vector machine. The Journal of Machine Learning Research 1, 211–244 (2001)
Witten, I.H., Frank, E.: Data mining: practical machine learning tools and techniques with java implementations. Morgan Kaufmann, San Francisco (2000)
Tipping, M.E., Faul, A.C.: Fast marginal likelihood maximisation for sparse bayesian models. In: Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, vol. 1 (2003)
Hastie, T., Tibshirani, R.: Classification by pairwise coupling. Annals of Statistics 26, 451–471 (1998)
Rong, J., Li, G., Chen, Y.P.P.: Acoustic feature selection for automatic emotion recognition from speech. Information Processing and Management 45, 315–328 (2009)
Paeschke, A., Sendlmeier, W.F.: Prosodic characteristics of emotional speech: measurements of fundamental frequency movements. In: SpeechEmotion (2000)
Engberg, I.S., Hansen, A.V.: Documentation of the danish emotional speech database DES. Interal AAU report, Center for Person Kommunikation, Denmark (1996)
Breazeal, C., Aryananda, L.: Recognition of affective communicative intent in robot-directed speech. Autonomous Robots 12, 83–104 (2002)
Slaney, M., McRoberts, G.: BabyEars: a recognition system for affective vocalization. Speech Communication 39, 367–384 (2003)
Hall, M.A.: Correlation-based feature selection for machine learning. Methodology (1999)
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machine. Machine Learning 46, 389–422 (2002)
Chandrakala, S., Sekhar, C.C.: Classification of multi-variate varying length time series using descriptive statistical features. Pattern Recognition and Machine Intelligence, 13–18 (2009)
Hammal, Z., Bozkurt, B., Couvreur, L., Unay, D., Caplier, A., Dutoit, T.: Passive versus active: vocal classification system. In: Proc. Eusipco, Turkey (2005)
Ververidis, D., Kotropoulos, C.: Automatic speech classification to five emotional states based on gender information. In: Proc. Eusipco, Vienna, pp. 341–344 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, F., Verhelst, W., Sahli, H. (2011). Relevance Vector Machine Based Speech Emotion Recognition. In: D’Mello, S., Graesser, A., Schuller, B., Martin, JC. (eds) Affective Computing and Intelligent Interaction. ACII 2011. Lecture Notes in Computer Science, vol 6975. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24571-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-24571-8_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24570-1
Online ISBN: 978-3-642-24571-8
eBook Packages: Computer ScienceComputer Science (R0)