Emotion Recognition Using Standard Deviation and Pitch as a Feature in a Marathi Emotional Utterances

Shinde, Ashok R.; Raut, Shriram D.; Agnihotri, Prashant P.; Khanale, Prakash B.

doi:10.1007/978-981-16-0507-9_45

Ashok R. Shinde⁷,
Shriram D. Raut⁷,
Prashant P. Agnihotri⁸ &
…
Prakash B. Khanale⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1380))

Included in the following conference series:

International Conference on Recent Trends in Image Processing and Pattern Recognition

679 Accesses

Abstract

Emotion recognition plays a very important role to make the human-computer interaction more natural. Basically two approaches were used by various researchers i.e. by using facial expression and tone of the voice. In this proposed work speech utterances in Marathi language are used. Seven basic emotions of human beings like angry, happy, disgust, surprise, sad, neutral, and fear have been used in the experimental work. The Marathi emotional words like Gap re (गपरे), Are wa (अरेवा!), Are Deva (अरेदेवा) are used as speech samples for feature extraction. The standard deviation and pitch of voice were determined using PRAAT software. Three speech samples have been used angry and neutral emotion. The Four speech samples have been used for remaining emotions i.e. happy, disgust, surprise, sad and fear. By analysing the feature value of standard deviation 100% recognition accuracy rate obtained for happy, disgust and surprise emotion. 75% accuracy rate for sad & fear and 66.66% accuracy rate for angry and & neutral emotion. The average recognition accuracy rate of seven emotions is 90%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Emotion Recognition for Instantaneous Marathi Spoken Words

Speech Emotion Recognition for Tamil Language Speakers

Text-Dependent Versus Text-Independent Speech Emotion Recognition

References

Vaishnav, S., Mitra, S.: Speech emotion recognition: a review. Int. J. Eng. Technol. (IRJET) 03(04) (2016)
Google Scholar
Kerkeni, L., Serrestou, Y., Mbarki, M., Roof, K., Mahjoub, M.A.: Speech emotion recognition: methods and cases study. In: Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART 2018), vol. 2, pp. 175–182 (2018)
Google Scholar
Santosh, K.C., Borra, S., Joshi, A., Dey, N.: Preface: special section: advances in speech, music and audio signal processing (articles 1–13). Int. J. Speech Technol. 22(2), 293–294 (2019)
Article Google Scholar
Desai, D.: Emotion recognition using speech signal: a review. Int. Res. J. Eng. Technol. (IRJET) 05(04) (2016)
Google Scholar
Kwon, O.-W., Chan, K., Hao, J., Lee, T.-W.: Emotion recognition by speech signals. In: EUROSPEECH 2003 - INTERSPEECH 2003 8th European Conference on Speech Communication and Technology Geneva, Switzerland, 1–4 September 2003 (2003)
Google Scholar
Lalitha, S., Madhavan, S., Bhushan, B., Saketh, S.: Speech emotion recognition. In: International Conference on Advances in Electronics, Computers and Communications (IJAECC) (2014)
Google Scholar
Sato, N., Obuchi, Y.: Emotion recognition using mel-frequency cepstral coefficients. J. Nat. Lang. Process. 2, 835–848 (2007)
Google Scholar
Zhang, Q., An, N., Wang, K., Ren, F., Li, L.: Speech emotion recognition using combination of features. In: 2013 Fourth International Conference on Intelligent Control and Information Processing (ICICIP), Beijing, China, 9–11 June 2013 (2013)
Google Scholar
Ingale, A.B., Chaudhari, D.S.: Speech emotion recognition. Int. J. Soft Comput. Eng. (IJSCE) 2(1), 235–238 (2012). ISSN 2231-2307
Google Scholar
El Ayadi, M., Kamel, M.S., Karray, F.: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn. 44, 572–587 (2011)
Article Google Scholar
Schuller, B., Rigoll, G., Lang, M.: Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine - belief network architecture. 0-7803-8484-©2004 IEEE
Google Scholar
Mukherjee, H., Obaidullah, S.M., Santosh, K.C., Phadikar, S., Roy, K.: Line spectral frequency-based features and extreme learning machine for voice activity detection from audio signal. Int. J. Speech Technol. 21(4), 753–760 (2018). https://doi.org/10.1007/s10772-018-9525-6
Article Google Scholar
Gaikwad, S.K., Gawali, B.W., Yannawar, P.: A review on speech recognition technique. Int. J. Comput. Appl. 10(3), 16–24 (2010)
Google Scholar
Yannawar, P.L., Manza, G.R., Gawali, B.W., Mehrotra, S.C.: Detection of redundant frame in audio visual speech recognition using low level analysis. In: 2010 IEEE International Conference on Computational Intelligence and Computing Research, Coimbatore, pp. 1–5 (2010). https://doi.org/10.1109/ICCIC.2010.5705746
Gawali, B.W., et al.: Marathi isolated word recognition system using MFCC and DTW features. In: Proceedings of the International Conference on Advances in Computer Science, vol. 1 (2010)
Google Scholar
Borde, P., Varpe, A., Manza, R., Yannawar, P.: Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition. Int. J. Speech Technol. 18(2), 167–175 (2014). https://doi.org/10.1007/s10772-014-9257-1
Article Google Scholar
Bordea, P., Varpeb, A., Manzac, R., Yannawara, P.: Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition (2014). arXiv preprint arXiv:1407.1165
Satonkar Suhas, S., Kurhe Ajay, B., Prakash Khanale, B.: Face recognition using principal component analysis and linear discriminant analysis on holistic approach in facial images database. Int. Organ. Sci. Res. 2(12), 15–23 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Punyashlok Ahilyadevi Holkar Solapur University, Solapur, India
Ashok R. Shinde & Shriram D. Raut
Sub-Centre Latur, Swami Ramanand Teerth University, Peth, India
Prashant P. Agnihotri
Dnyanopasak College, Parbhani, India
Prakash B. Khanale

Authors

Ashok R. Shinde
View author publications
You can also search for this author in PubMed Google Scholar
Shriram D. Raut
View author publications
You can also search for this author in PubMed Google Scholar
Prashant P. Agnihotri
View author publications
You can also search for this author in PubMed Google Scholar
Prakash B. Khanale
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ashok R. Shinde .

Editor information

Editors and Affiliations

University of South Dakota, Vermillion, SD, USA
K. C. Santosh
Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, India
Bharti Gawali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shinde, A.R., Raut, S.D., Agnihotri, P.P., Khanale, P.B. (2021). Emotion Recognition Using Standard Deviation and Pitch as a Feature in a Marathi Emotional Utterances. In: Santosh, K.C., Gawali, B. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2020. Communications in Computer and Information Science, vol 1380. Springer, Singapore. https://doi.org/10.1007/978-981-16-0507-9_45

Download citation

DOI: https://doi.org/10.1007/978-981-16-0507-9_45
Published: 26 February 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0506-2
Online ISBN: 978-981-16-0507-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Emotion Recognition Using Standard Deviation and Pitch as a Feature in a Marathi Emotional Utterances

Abstract

Access this chapter

Similar content being viewed by others

Emotion Recognition for Instantaneous Marathi Spoken Words

Speech Emotion Recognition for Tamil Language Speakers

Text-Dependent Versus Text-Independent Speech Emotion Recognition

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Emotion Recognition Using Standard Deviation and Pitch as a Feature in a Marathi Emotional Utterances

Abstract

Access this chapter

Similar content being viewed by others

Emotion Recognition for Instantaneous Marathi Spoken Words

Speech Emotion Recognition for Tamil Language Speakers

Text-Dependent Versus Text-Independent Speech Emotion Recognition

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation