Skip to main content

Emotion Recognition Using Standard Deviation and Pitch as a Feature in a Marathi Emotional Utterances

  • Conference paper
  • First Online:
Recent Trends in Image Processing and Pattern Recognition (RTIP2R 2020)

Abstract

Emotion recognition plays a very important role to make the human-computer interaction more natural. Basically two approaches were used by various researchers i.e. by using facial expression and tone of the voice. In this proposed work speech utterances in Marathi language are used. Seven basic emotions of human beings like angry, happy, disgust, surprise, sad, neutral, and fear have been used in the experimental work. The Marathi emotional words like Gap re (गपरे), Are wa (अरेवा!), Are Deva (अरेदेवा) are used as speech samples for feature extraction. The standard deviation and pitch of voice were determined using PRAAT software. Three speech samples have been used angry and neutral emotion. The Four speech samples have been used for remaining emotions i.e. happy, disgust, surprise, sad and fear. By analysing the feature value of standard deviation 100% recognition accuracy rate obtained for happy, disgust and surprise emotion. 75% accuracy rate for sad & fear and 66.66% accuracy rate for angry and & neutral emotion. The average recognition accuracy rate of seven emotions is 90%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Vaishnav, S., Mitra, S.: Speech emotion recognition: a review. Int. J. Eng. Technol. (IRJET) 03(04) (2016)

    Google Scholar 

  2. Kerkeni, L., Serrestou, Y., Mbarki, M., Roof, K., Mahjoub, M.A.: Speech emotion recognition: methods and cases study. In: Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART 2018), vol. 2, pp. 175–182 (2018)

    Google Scholar 

  3. Santosh, K.C., Borra, S., Joshi, A., Dey, N.: Preface: special section: advances in speech, music and audio signal processing (articles 1–13). Int. J. Speech Technol. 22(2), 293–294 (2019)

    Article  Google Scholar 

  4. Desai, D.: Emotion recognition using speech signal: a review. Int. Res. J. Eng. Technol. (IRJET) 05(04) (2016)

    Google Scholar 

  5. Kwon, O.-W., Chan, K., Hao, J., Lee, T.-W.: Emotion recognition by speech signals. In: EUROSPEECH 2003 - INTERSPEECH 2003 8th European Conference on Speech Communication and Technology Geneva, Switzerland, 1–4 September 2003 (2003)

    Google Scholar 

  6. Lalitha, S., Madhavan, S., Bhushan, B., Saketh, S.: Speech emotion recognition. In: International Conference on Advances in Electronics, Computers and Communications (IJAECC) (2014)

    Google Scholar 

  7. Sato, N., Obuchi, Y.: Emotion recognition using mel-frequency cepstral coefficients. J. Nat. Lang. Process. 2, 835–848 (2007)

    Google Scholar 

  8. Zhang, Q., An, N., Wang, K., Ren, F., Li, L.: Speech emotion recognition using combination of features. In: 2013 Fourth International Conference on Intelligent Control and Information Processing (ICICIP), Beijing, China, 9–11 June 2013 (2013)

    Google Scholar 

  9. Ingale, A.B., Chaudhari, D.S.: Speech emotion recognition. Int. J. Soft Comput. Eng. (IJSCE) 2(1), 235–238 (2012). ISSN 2231-2307

    Google Scholar 

  10. El Ayadi, M., Kamel, M.S., Karray, F.: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn. 44, 572–587 (2011)

    Article  Google Scholar 

  11. Schuller, B., Rigoll, G., Lang, M.: Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine - belief network architecture. 0-7803-8484-©2004 IEEE

    Google Scholar 

  12. Mukherjee, H., Obaidullah, S.M., Santosh, K.C., Phadikar, S., Roy, K.: Line spectral frequency-based features and extreme learning machine for voice activity detection from audio signal. Int. J. Speech Technol. 21(4), 753–760 (2018). https://doi.org/10.1007/s10772-018-9525-6

    Article  Google Scholar 

  13. Gaikwad, S.K., Gawali, B.W., Yannawar, P.: A review on speech recognition technique. Int. J. Comput. Appl. 10(3), 16–24 (2010)

    Google Scholar 

  14. Yannawar, P.L., Manza, G.R., Gawali, B.W., Mehrotra, S.C.: Detection of redundant frame in audio visual speech recognition using low level analysis. In: 2010 IEEE International Conference on Computational Intelligence and Computing Research, Coimbatore, pp. 1–5 (2010). https://doi.org/10.1109/ICCIC.2010.5705746

  15. Gawali, B.W., et al.: Marathi isolated word recognition system using MFCC and DTW features. In: Proceedings of the International Conference on Advances in Computer Science, vol. 1 (2010)

    Google Scholar 

  16. Borde, P., Varpe, A., Manza, R., Yannawar, P.: Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition. Int. J. Speech Technol. 18(2), 167–175 (2014). https://doi.org/10.1007/s10772-014-9257-1

    Article  Google Scholar 

  17. Bordea, P., Varpeb, A., Manzac, R., Yannawara, P.: Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition (2014). arXiv preprint arXiv:1407.1165

  18. Satonkar Suhas, S., Kurhe Ajay, B., Prakash Khanale, B.: Face recognition using principal component analysis and linear discriminant analysis on holistic approach in facial images database. Int. Organ. Sci. Res. 2(12), 15–23 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ashok R. Shinde .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shinde, A.R., Raut, S.D., Agnihotri, P.P., Khanale, P.B. (2021). Emotion Recognition Using Standard Deviation and Pitch as a Feature in a Marathi Emotional Utterances. In: Santosh, K.C., Gawali, B. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2020. Communications in Computer and Information Science, vol 1380. Springer, Singapore. https://doi.org/10.1007/978-981-16-0507-9_45

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-0507-9_45

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-0506-2

  • Online ISBN: 978-981-16-0507-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics