Abstract
As technological advancements progress, dependence on machines is inevitable. Therefore, to facilitate effective interaction between humans and machines, it has become crucial to develop proficient techniques for Speech Emotion Recognition (SER). This paper uses phase-based features, namely Modified Group Delay Cepstral Coefficients for SER. To the best of our knowledge, this paper is the first attempt to use the MGDCC feature on emotions. Experiments were performed using the EmoDB database on emotions, anger, happy, neutral, and sad. The proposed feature outperformed the baseline Mel Frequency Cepstral Coefficients (MFCC) and Linear Frequency Cepstral Coefficients (LFCC) by 7.7 % and 5.14 %, respectively. The noise robustness characteristics of MGDCC were tested on stationary and non-stationary noise and the results were promising. The latency period was also analysed and MGDCC proved to be the most practically suitable feature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Tarnowski, P., KoÅodziej, M., Majkowski, A., Rak, R.J.: Emotion recognition using facial expressions. Procedia Comput. Sci. 108, 1175ā1184 (2017)
Basu, S., Chakraborty, J., Bag, A., Aftabuddin, M.: A review on emotion recognition using speech. In: 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT), pp. 109ā114. IEEE (2017)
Abramson, L., Petranker, R., Marom, I., Aviezer, H.: Social interaction context shapes emotion recognition through body language, not facial expressions. Emotion 21(3), 557 (2021)
Khalil, R.A., Jones, E., Babar, M.I., Jan, T., Zafar, M.H., Alhussain, T.: Speech emotion recognition using deep learning techniques: a review. IEEE Access 7, 117327ā117345 (2019)
Cabanac, M.: What is emotion? Behav. Proc. 60(2), 69ā83 (2002)
Sethu, V., Ambikairajah, E., Epps, J.: Group delay features for emotion detection. In: Eighth Annual Conference of the International Speech Communication Association (2007)
Swain, M., Routray, A., Kabisatpathy, P.: Databases, features and classifiers for speech emotion recognition: a review. Int. J. Speech Technol. 21(1), 93ā120 (2018). https://doi.org/10.1007/s10772-018-9491-z
Schluter, R., Ney, H.: Using phase spectrum information for improved speech recognition performance. In: 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No. 01CH37221), vol. 1, pp. 133ā136. IEEE (2001)
Murthy, H.A.: Algorithms for processing Fourier transform phase of signals, Ph.D. dissertation, Indian Institute of Technology, Department of Computer ... (1992)
Murthy, H.A., Gadde, V.: The modified group delay function and its application to phoneme recognition. In: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP 2003), vol. 1, pp. I-68. IEEE (2003)
Hegde, R.M., Murthy, H.A., Gadde, V.R.R.: Significance of the modified group delay feature in speech recognition. IEEE Trans. Audio Speech Lang. Process. 15(1), 190ā202 (2006)
Parthasarathi, S.H.K., Rajan, P., Murthy, H.A.: Robustness of group delay representations for noisy speech signals. Technical report, Idiap (2011)
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W.F., Weiss, B.: A database of German emotional speech. In: Interspeech, vol. 5, pp. 1517ā1520 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Uthiraa, S., Pusuluri, A., Patil, H.A. (2023). Modified Group Delay Features forĀ Emotion Recognition. In: Maji, P., Huang, T., Pal, N.R., Chaudhury, S., De, R.K. (eds) Pattern Recognition and Machine Intelligence. PReMI 2023. Lecture Notes in Computer Science, vol 14301. Springer, Cham. https://doi.org/10.1007/978-3-031-45170-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-031-45170-6_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-45169-0
Online ISBN: 978-3-031-45170-6
eBook Packages: Computer ScienceComputer Science (R0)