Skip to main content

Automatic Speech Processing of Marathi Speaker İdentification for Isolated Words System

  • Conference paper
  • First Online:
Recent Trends in Image Processing and Pattern Recognition (RTIP2R 2020)

Abstract

In the prevailing period of innovation the automatic identification of speaker assumes a significant role. The application of speaker recognition is spin towards Biometric security. This paper depicted a speaker recognition framework for isolated word dataset. The feature extraction has been finished utilizing Mel Frequency Cepstral Coefficient (MFCC) techniques. The database of the research is structure and creates utilizing 25 Male and 25 Female speakers. The size of dataset is 2500 isolated words. The content for the dataset recording is chosen based on vowel letters in order. The execution of the framework is determined utilizing False Rejection Rate (FRR), False Acceptance Rate (FAR). The precision of the Speaker Recognition rate for Male is better as compare with the exactness of female. This structure is utilized for speaker recognition framework for the confined word distinguishing proof framework by applying highlight extraction methods as MFCC and arrangement is finished with Euclidian Distance. We got a normal exactness for Male rate is 85% and 81% for female. The exhibition of the haphazardly chosen subject gathering was 79%. This is the general precision pace of Speaker Recognition framework for Marathi Isolated Words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Torfi, A., Nasrabadi, N.M., Dawson, J.: Text-independent speaker verification using 3D convolutional neural networks. arXiv preprint arXiv:1705.09422 (2017)

  2. Furui, S.: Speaker independent isolated word recognition using dynamic features of speech spectrum. IEEE Trans. Acoust. Speech Signal Process. ASSP 34(1), 2–59 (1986)

    Google Scholar 

  3. Gaikwad, B.P., Kamble, P.S.: Speech processing for secluded Marathi words recognition using MFCC features. Int. J. Innov. Res. Comput. Commun. Eng. (IJIRCCE) 3(8), (2015). ISSN (Online): 2320-9801, ISSN (Print): 2320-9798

    Google Scholar 

  4. https://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers.html. Accessed 1 June 2014

  5. Kamble, V.V., Gaikwad, B.P., Rana, D.M.: Spontaneous emotion recognition for Marathi spoken words. In: International Conference on Communication and Signal Processing, India, 3–5 April 2014, 978-1-4799-3356-3/14/$31.00 © IEEE (2014)

    Google Scholar 

  6. Shrawankar, U., Thakare, V.: Techniques for feature extraction in speech recognition system: a comparative study. Int. J. Comput. Appl. Eng. Technol. Sci. (IJCAETS) 412–418 (2010). ISSN 0974-3596

    Google Scholar 

  7. Furui, S.: An overview of speaker recognition technology. In: ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, pp. 1–9 (1994)

    Google Scholar 

  8. Anusuya, M.A., Katti, S.K.: Speech recognition by machine: a review. Int. J. Comput. Sci. Inf. Secur. (IJCSIS) 6(3), 181–205 (2009)

    Google Scholar 

  9. Huang, X.D.: A study on speaker - adaptive speech recognition. In: Proceedings DARPA Workshop on Speech and Natural Language, pp. 278–283 (1991)

    Google Scholar 

  10. Rabiner, L.R., Schafer, R.W.: Digital Processing of Speech Signals, Signal Processing. Prentice-Hall, Englewood Cliffs (1978)

    Google Scholar 

  11. Singh, B., Kaur, R., Devgun, N., Kaur, R.: The process of feature extraction in automatic speech recognition system for computer machine interaction with humans: a review. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2(2), 1–7 (2012)

    Google Scholar 

  12. Tiwari, V.: MFCC and its application in speaker recognition. Int. J. Emerg. Technol. 1(1), 19–22 (2010)

    Google Scholar 

  13. Levinson, S.E.: Structural methods in automatic speech recognition. Proc. IEEE 73(11), 1625–1650 (1985)

    Article  Google Scholar 

  14. Srinivasa Kumar, Ch., Mallikarjuna Rao, P.: Int. J. Comput. Sci. Eng. 3(8), 2942–2954 (2011)

    Google Scholar 

  15. Muda, L., Begam, M., Elamvazuthi, I.: Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. J. Comput. 2(3) (2010). ISSN 2151-9617

    Google Scholar 

  16. Huang, C., Chang, E., Chen, T.: Accent issues in large vocabulary continuous speech recognition (LVCSR). Microsoft Research China, MSR-TR-2001-69, pp. 1–27 (2001)

    Google Scholar 

  17. Bachate, R.P., Sharma, A.: Automatic speech recognition systems for regional languages in India. Int. J. Recent Technol. Eng. (IJRTE) 8(2S3) (2019). ISSN 2277-3878

    Google Scholar 

  18. Gawali, B.W., Gaikwad, S., Yannawar, P., Mehrotra, S.C.: marathi isolated word recognition system using MFCC and DTW features. In: Proceedings of International Conference on Advances in Computer Science, vol. 1 (2010)

    Google Scholar 

  19. Gaikwad, S., Gawali, B., Yannawar, P., Mehrotra, S.: Feature extraction using fusion MFCC for continuous marathi speech recognition. In: 2011 Annual IEEE India Conference. IEEE, 16 December 2011

    Google Scholar 

  20. Gaikwad, S.K., Gawali, B.W., Yannawar, P.: A review on speech recognition technique. Int. J. Comput. Appl. 10(3) (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pawan Kamble .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kamble, P. et al. (2021). Automatic Speech Processing of Marathi Speaker İdentification for Isolated Words System. In: Santosh, K.C., Gawali, B. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2020. Communications in Computer and Information Science, vol 1381. Springer, Singapore. https://doi.org/10.1007/978-981-16-0493-5_30

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-0493-5_30

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-0492-8

  • Online ISBN: 978-981-16-0493-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics