Abstract
Speech Emotion Recognition (SER) is a hot research topic in the field of Human Computer Interaction. In this paper a SER system is developed with the aim of providing a classification of the “state of interest” of a human subject involved in a job interview. Classification of emotions is performed by analyzing the speech produced during the interview. The presented methods and results show just preliminary conclusions, as the work is part of a larger project including also analysis, investigation and classification of facial expressions and body gestures during human interaction. At the current state of the work, investigation is carried out by using software tools already available for free on the web; furthermore, the features extracted from the audio tracks are analyzed by studying their sensitivity to an audio compression stage. The Berlin Database of Emotional Speech (EmoDB) is exploited to provide the preliminary results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bevilacqua, V., et al.: A face recognition system based on Pseudo 2D HMM applied to neural network coefficients. Soft Computing 12(7), 615–621 (2008)
Bevilacqua, V., et al.: 3D nose feature identification and localization through Self-Organizing Map and Graph Matching. Journal of Circuits Systems and Computers, 191–202 (2010)
Bevilacqua, V., Pannarale, P., Abbrescia, M., Cava, C., Paradiso, A., Tommasi, S.: Comparison of data-merging methods with SVM attribute selection and classification in breast cancer gene expression. BMC Bioinformatics 13(7), S9 (2012), doi:10.1186/1471 2105-13-S7-S9
Bevilacqua, V.: Three-dimensional virtual colonoscopy for automatic polyps detection by artificial neural network approach: New tests on an enlarged cohort of polyps. Neurocomputting, .0925–2312 (2012)
Bishop, C.M.: Pattern Recognition and Machine Learning Information Science and Statistics (2006)
Boersma, P.: Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. Proceedings of the Institute of Phonetic Sciences 17, 97–110 (1993)
Boersma, P.: Praat: a system for doing phonetics by computer. Glot International 9(10), 341–345 (2001)
Bou-Ghazale, S.E., Hansen, J.H.L.: A comparative study of traditional and newly proposed features for recognition of speech under stress. IEEE Transactions on Speech and Audio Processing 8(4), 429–442 (2000)
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., Weiss, B.: A database of german emotional speech. In: Proceedings on Interspeech 2005, pp. 1517–1520 (2005)
Cortes, C., Vapnik, V.: Support-Vector Networks. Mach. Learn 20(3) (1995)
Dellaert, F., Polzin, T., Waibel, A.: Recognizing Emotion in Speech. Proceedings on Spoken Language 3, 1970–1973 (1996)
Eyben, F., Wollmer, M., Schuller, B.: Introducing the Munich open-source emotion and affect recognition toolkit. In: 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops (2009)
Fant, G.: The Acoustic Theory of Speech Production. Mouton, The Hague (1960)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11, 1 (2009)
Hastie, T., Tibshirani, R.: Classification by Pairwise Coupling. In: Advances in Neural Information Processing Systems (1998)
Golub, T.R.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene Selection for Cancer Classification using Support Vector Machines. Mach. Learn. 46(1-3) (2002)
Jabloun, F.: Teager Energy Based Feature Parameters for Speech Recognition in Car Noise. IEEE Signal Processing Letters 6(10) (1999)
Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer Series in Statistics. Springer, NY (2002)
McGilloway, S., Cowie, R., Douglas-Cowie, E., Gielen, S., Westerdijk, M., Stroeve, S.: Approaching automatic recognition of emotion from voice: a rough benchmark. In: ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion (2000)
Mitchell, T.M.: Machine Learning, 1st edn. McGraw-Hill, New York (1997)
New, T.L., Foo, S.W., De, S.L.C.: Speech emotion recognition using hidden Markov models. Speech Communication 41(4), 603–623 (2003)
Platt, J.C.: Fast Training of Support Vector Machines using Sequential Minimal Optimization. In: Schoelkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT Press (1998)
Scherer, S., Hofmann, H., Lampmann, M., Pfeil, M., Rhinow, S., Schwenker, F., Palm, G.: Emotion Recognition from Speech: Stress Experiment. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008) European Language Resources Association, ELRA (2008)
Scherer, K.R., Banse, R., Wallbott, H.G., Goldbeck, T.: Vocal cues in Emotion Encoding and Decoding. Motivation and Emotion 15, 123–148 (1996)
Scherer, K.R., Johnstone, T., Klasmeyer, G.: Vocal expression of emotion. Oxford University Press, New York (2003)
Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Academic Press (2008)
Ververidis, D., Kotropoulos, C.: Emotional speech recognition: Resources, features, and methods. Speech Communication 48(9), 1162–1181 (2006)
Ververidis, D., Kotropoulos, C., Pitas, I.: Automatic emotional speech classification. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 1, pp. I-593. IEEE (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bevilacqua, V., Guccione, P., Mascolo, L., Pazienza, P.P., Salatino, A.A., Pantaleo, M. (2013). First Progresses in Evaluation of Resonance in Staff Selection through Speech Emotion Recognition. In: Huang, DS., Jo, KH., Zhou, YQ., Han, K. (eds) Intelligent Computing Theories and Technology. ICIC 2013. Lecture Notes in Computer Science(), vol 7996. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39482-9_76
Download citation
DOI: https://doi.org/10.1007/978-3-642-39482-9_76
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39481-2
Online ISBN: 978-3-642-39482-9
eBook Packages: Computer ScienceComputer Science (R0)