Abstract
Emotion recognition plays a very important role to make the human-computer interaction more natural. Basically two approaches were used by various researchers i.e. by using facial expression and tone of the voice. In this proposed work speech utterances in Marathi language are used. Seven basic emotions of human beings like angry, happy, disgust, surprise, sad, neutral, and fear have been used in the experimental work. The Marathi emotional words like Gap re (गपरे), Are wa (अरेवा!), Are Deva (अरेदेवा) are used as speech samples for feature extraction. The standard deviation and pitch of voice were determined using PRAAT software. Three speech samples have been used angry and neutral emotion. The Four speech samples have been used for remaining emotions i.e. happy, disgust, surprise, sad and fear. By analysing the feature value of standard deviation 100% recognition accuracy rate obtained for happy, disgust and surprise emotion. 75% accuracy rate for sad & fear and 66.66% accuracy rate for angry and & neutral emotion. The average recognition accuracy rate of seven emotions is 90%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Vaishnav, S., Mitra, S.: Speech emotion recognition: a review. Int. J. Eng. Technol. (IRJET) 03(04) (2016)
Kerkeni, L., Serrestou, Y., Mbarki, M., Roof, K., Mahjoub, M.A.: Speech emotion recognition: methods and cases study. In: Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART 2018), vol. 2, pp. 175–182 (2018)
Santosh, K.C., Borra, S., Joshi, A., Dey, N.: Preface: special section: advances in speech, music and audio signal processing (articles 1–13). Int. J. Speech Technol. 22(2), 293–294 (2019)
Desai, D.: Emotion recognition using speech signal: a review. Int. Res. J. Eng. Technol. (IRJET) 05(04) (2016)
Kwon, O.-W., Chan, K., Hao, J., Lee, T.-W.: Emotion recognition by speech signals. In: EUROSPEECH 2003 - INTERSPEECH 2003 8th European Conference on Speech Communication and Technology Geneva, Switzerland, 1–4 September 2003 (2003)
Lalitha, S., Madhavan, S., Bhushan, B., Saketh, S.: Speech emotion recognition. In: International Conference on Advances in Electronics, Computers and Communications (IJAECC) (2014)
Sato, N., Obuchi, Y.: Emotion recognition using mel-frequency cepstral coefficients. J. Nat. Lang. Process. 2, 835–848 (2007)
Zhang, Q., An, N., Wang, K., Ren, F., Li, L.: Speech emotion recognition using combination of features. In: 2013 Fourth International Conference on Intelligent Control and Information Processing (ICICIP), Beijing, China, 9–11 June 2013 (2013)
Ingale, A.B., Chaudhari, D.S.: Speech emotion recognition. Int. J. Soft Comput. Eng. (IJSCE) 2(1), 235–238 (2012). ISSN 2231-2307
El Ayadi, M., Kamel, M.S., Karray, F.: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn. 44, 572–587 (2011)
Schuller, B., Rigoll, G., Lang, M.: Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine - belief network architecture. 0-7803-8484-©2004 IEEE
Mukherjee, H., Obaidullah, S.M., Santosh, K.C., Phadikar, S., Roy, K.: Line spectral frequency-based features and extreme learning machine for voice activity detection from audio signal. Int. J. Speech Technol. 21(4), 753–760 (2018). https://doi.org/10.1007/s10772-018-9525-6
Gaikwad, S.K., Gawali, B.W., Yannawar, P.: A review on speech recognition technique. Int. J. Comput. Appl. 10(3), 16–24 (2010)
Yannawar, P.L., Manza, G.R., Gawali, B.W., Mehrotra, S.C.: Detection of redundant frame in audio visual speech recognition using low level analysis. In: 2010 IEEE International Conference on Computational Intelligence and Computing Research, Coimbatore, pp. 1–5 (2010). https://doi.org/10.1109/ICCIC.2010.5705746
Gawali, B.W., et al.: Marathi isolated word recognition system using MFCC and DTW features. In: Proceedings of the International Conference on Advances in Computer Science, vol. 1 (2010)
Borde, P., Varpe, A., Manza, R., Yannawar, P.: Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition. Int. J. Speech Technol. 18(2), 167–175 (2014). https://doi.org/10.1007/s10772-014-9257-1
Bordea, P., Varpeb, A., Manzac, R., Yannawara, P.: Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition (2014). arXiv preprint arXiv:1407.1165
Satonkar Suhas, S., Kurhe Ajay, B., Prakash Khanale, B.: Face recognition using principal component analysis and linear discriminant analysis on holistic approach in facial images database. Int. Organ. Sci. Res. 2(12), 15–23 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shinde, A.R., Raut, S.D., Agnihotri, P.P., Khanale, P.B. (2021). Emotion Recognition Using Standard Deviation and Pitch as a Feature in a Marathi Emotional Utterances. In: Santosh, K.C., Gawali, B. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2020. Communications in Computer and Information Science, vol 1380. Springer, Singapore. https://doi.org/10.1007/978-981-16-0507-9_45
Download citation
DOI: https://doi.org/10.1007/978-981-16-0507-9_45
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0506-2
Online ISBN: 978-981-16-0507-9
eBook Packages: Computer ScienceComputer Science (R0)