Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 264))

Abstract

In this paper, the authors have used 2-layer feed forward neural network for Hindi dialect recognition. A Dialect is a pattern of pronunciation of a language used by a community of native speakers belonging to the same geographical region. In this work, speech features have been explored to recognize four major dialects of Hindi. The dialects under consideration areKhariboli (spoken in West Uttar Pradesh, Delhi and some parts of Uttarakhand and Himachal Pradesh), Bhojpuri (spoken by population of East Uttar Pradesh, Bihar and Jharkhand), Haryanvi (spoken in Haryana, parts of Delhi, Uttar Pradesh and Uttarakhand) and Bagheli (spoken in Central India). Speech corpus for this work is collected from 15 speakers (including both male and female) from each dialect. The syllables of CVC structure is used as processing unit. Spectral features (MFCC) and prosodic features (duration and pitch contour) are extracted from speech for discriminating the dialects. Performance of the system is observed with spectral features and prosodic features as input. Results show that the system performs best when all the spectral and prosodic features are combined together to form input feature set during network training. The dialect recognition system shows a recognition score of 79% with these input features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Liu, M., Xu, B., Hunng, T., Deng, Y., Li, C.: Mandarin accent adaptation based on context independent/context dependent pronunciation modeling. In: Proceedings of The Acoustic, Speech and Signal Processing, ICASSP 2000, Washington DC, USA, pp. 1025–1028 (2000)

    Google Scholar 

  2. Behravan, H.: Dialect and accent recognition. Dissertation, University of Eastern Finland (2012)

    Google Scholar 

  3. Mishra, D., Bali, K.: A comparative phonological study of the dialects of Hindi. In: Proceedings of ICPhS XVII, Hong Kong, pp. 17–21 (2011)

    Google Scholar 

  4. Zue, W., Hazen, T.J.: Automatic language identification using segment based approach. In: Proceedings of Eurospeech, pp. 1303–1306 (1993)

    Google Scholar 

  5. Rao, K.S., Nandy, S., Koolagudi, S.G.: Identification of Hindi dialect using speech. In: Proceedings of WMSCI 2010- the 14th World Multi-Conference on Systemics, Cybernetics and Informatics, Orlando, Florida, USA (2010)

    Google Scholar 

  6. Mehrabani, M., Boril, H., Hansen, J.H.L.: Dialect distance assessment method based on comparision of pitch pattern statistical models. In: Proceedings of ICASSP, Dallas, USA, pp. 5158–5161 (2010)

    Google Scholar 

  7. Lee, H., Seong, C.J.: Experimental phonetic study of the syllable duration of Korean with respect to the positional effect. In: Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP), Philadelphia, pp. 1193–1196 (1996)

    Google Scholar 

  8. Sinha, S., Agrawal, S.S., Jain, A.: Dialectal influences on acoustic duration of Hindi phonemes. In: Proceedings of Oriental-COCOSDA-2013, Gurgaon, India (2013)

    Google Scholar 

  9. Aggarwal, R.K., Dave, M.: Integration of multiple acoustic and language models for improved Hindi speech recognition system. International Journal of Speech Technology (IJST) 15, 165–180 (2012)

    Article  Google Scholar 

  10. Haykin, S.: Neural Networks: A comprehensive foundation. Pearson Education Asia, Inc., New Delhi (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shweta Sinha .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Sinha, S., Jain, A., Agrawal, S.S. (2014). Speech Processing for Hindi Dialect Recognition. In: Thampi, S., Gelbukh, A., Mukhopadhyay, J. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 264. Springer, Cham. https://doi.org/10.1007/978-3-319-04960-1_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-04960-1_14

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-04959-5

  • Online ISBN: 978-3-319-04960-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics