Abstract
In this paper, the authors have used 2-layer feed forward neural network for Hindi dialect recognition. A Dialect is a pattern of pronunciation of a language used by a community of native speakers belonging to the same geographical region. In this work, speech features have been explored to recognize four major dialects of Hindi. The dialects under consideration areKhariboli (spoken in West Uttar Pradesh, Delhi and some parts of Uttarakhand and Himachal Pradesh), Bhojpuri (spoken by population of East Uttar Pradesh, Bihar and Jharkhand), Haryanvi (spoken in Haryana, parts of Delhi, Uttar Pradesh and Uttarakhand) and Bagheli (spoken in Central India). Speech corpus for this work is collected from 15 speakers (including both male and female) from each dialect. The syllables of CVC structure is used as processing unit. Spectral features (MFCC) and prosodic features (duration and pitch contour) are extracted from speech for discriminating the dialects. Performance of the system is observed with spectral features and prosodic features as input. Results show that the system performs best when all the spectral and prosodic features are combined together to form input feature set during network training. The dialect recognition system shows a recognition score of 79% with these input features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Liu, M., Xu, B., Hunng, T., Deng, Y., Li, C.: Mandarin accent adaptation based on context independent/context dependent pronunciation modeling. In: Proceedings of The Acoustic, Speech and Signal Processing, ICASSP 2000, Washington DC, USA, pp. 1025–1028 (2000)
Behravan, H.: Dialect and accent recognition. Dissertation, University of Eastern Finland (2012)
Mishra, D., Bali, K.: A comparative phonological study of the dialects of Hindi. In: Proceedings of ICPhS XVII, Hong Kong, pp. 17–21 (2011)
Zue, W., Hazen, T.J.: Automatic language identification using segment based approach. In: Proceedings of Eurospeech, pp. 1303–1306 (1993)
Rao, K.S., Nandy, S., Koolagudi, S.G.: Identification of Hindi dialect using speech. In: Proceedings of WMSCI 2010- the 14th World Multi-Conference on Systemics, Cybernetics and Informatics, Orlando, Florida, USA (2010)
Mehrabani, M., Boril, H., Hansen, J.H.L.: Dialect distance assessment method based on comparision of pitch pattern statistical models. In: Proceedings of ICASSP, Dallas, USA, pp. 5158–5161 (2010)
Lee, H., Seong, C.J.: Experimental phonetic study of the syllable duration of Korean with respect to the positional effect. In: Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP), Philadelphia, pp. 1193–1196 (1996)
Sinha, S., Agrawal, S.S., Jain, A.: Dialectal influences on acoustic duration of Hindi phonemes. In: Proceedings of Oriental-COCOSDA-2013, Gurgaon, India (2013)
Aggarwal, R.K., Dave, M.: Integration of multiple acoustic and language models for improved Hindi speech recognition system. International Journal of Speech Technology (IJST) 15, 165–180 (2012)
Haykin, S.: Neural Networks: A comprehensive foundation. Pearson Education Asia, Inc., New Delhi (2002)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Sinha, S., Jain, A., Agrawal, S.S. (2014). Speech Processing for Hindi Dialect Recognition. In: Thampi, S., Gelbukh, A., Mukhopadhyay, J. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 264. Springer, Cham. https://doi.org/10.1007/978-3-319-04960-1_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-04960-1_14
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04959-5
Online ISBN: 978-3-319-04960-1
eBook Packages: EngineeringEngineering (R0)