Speech Processing for Hindi Dialect Recognition

Sinha, Shweta; Jain, Aruna; Agrawal, Shyam S.

doi:10.1007/978-3-319-04960-1_14

Shweta Sinha⁵,
Aruna Jain⁵ &
Shyam S. Agrawal⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 264))

2781 Accesses
8 Citations

Abstract

In this paper, the authors have used 2-layer feed forward neural network for Hindi dialect recognition. A Dialect is a pattern of pronunciation of a language used by a community of native speakers belonging to the same geographical region. In this work, speech features have been explored to recognize four major dialects of Hindi. The dialects under consideration areKhariboli (spoken in West Uttar Pradesh, Delhi and some parts of Uttarakhand and Himachal Pradesh), Bhojpuri (spoken by population of East Uttar Pradesh, Bihar and Jharkhand), Haryanvi (spoken in Haryana, parts of Delhi, Uttar Pradesh and Uttarakhand) and Bagheli (spoken in Central India). Speech corpus for this work is collected from 15 speakers (including both male and female) from each dialect. The syllables of CVC structure is used as processing unit. Spectral features (MFCC) and prosodic features (duration and pitch contour) are extracted from speech for discriminating the dialects. Performance of the system is observed with spectral features and prosodic features as input. Results show that the system performs best when all the spectral and prosodic features are combined together to form input feature set during network training. The dialect recognition system shows a recognition score of 79% with these input features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Liu, M., Xu, B., Hunng, T., Deng, Y., Li, C.: Mandarin accent adaptation based on context independent/context dependent pronunciation modeling. In: Proceedings of The Acoustic, Speech and Signal Processing, ICASSP 2000, Washington DC, USA, pp. 1025–1028 (2000)
Google Scholar
Behravan, H.: Dialect and accent recognition. Dissertation, University of Eastern Finland (2012)
Google Scholar
Mishra, D., Bali, K.: A comparative phonological study of the dialects of Hindi. In: Proceedings of ICPhS XVII, Hong Kong, pp. 17–21 (2011)
Google Scholar
Zue, W., Hazen, T.J.: Automatic language identification using segment based approach. In: Proceedings of Eurospeech, pp. 1303–1306 (1993)
Google Scholar
Rao, K.S., Nandy, S., Koolagudi, S.G.: Identification of Hindi dialect using speech. In: Proceedings of WMSCI 2010- the 14th World Multi-Conference on Systemics, Cybernetics and Informatics, Orlando, Florida, USA (2010)
Google Scholar
Mehrabani, M., Boril, H., Hansen, J.H.L.: Dialect distance assessment method based on comparision of pitch pattern statistical models. In: Proceedings of ICASSP, Dallas, USA, pp. 5158–5161 (2010)
Google Scholar
Lee, H., Seong, C.J.: Experimental phonetic study of the syllable duration of Korean with respect to the positional effect. In: Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP), Philadelphia, pp. 1193–1196 (1996)
Google Scholar
Sinha, S., Agrawal, S.S., Jain, A.: Dialectal influences on acoustic duration of Hindi phonemes. In: Proceedings of Oriental-COCOSDA-2013, Gurgaon, India (2013)
Google Scholar
Aggarwal, R.K., Dave, M.: Integration of multiple acoustic and language models for improved Hindi speech recognition system. International Journal of Speech Technology (IJST) 15, 165–180 (2012)
Article Google Scholar
Haykin, S.: Neural Networks: A comprehensive foundation. Pearson Education Asia, Inc., New Delhi (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Birla Institute of Technology, Mesra, Ranchi, India
Shweta Sinha & Aruna Jain
KIIT College of Engineering, Gurgaon, India
Shyam S. Agrawal

Authors

Shweta Sinha
View author publications
You can also search for this author in PubMed Google Scholar
Aruna Jain
View author publications
You can also search for this author in PubMed Google Scholar
Shyam S. Agrawal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shweta Sinha .

Editor information

Editors and Affiliations

Technopark Campus, Indian Institute of Information Technology and Management – Kerala (IIITM-K), Trivandrum, Kerala, India
Sabu M. Thampi
Center for Computing Research, National Polytechnic Institute, Mexico City, Mexico
Alexander Gelbukh
Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur, India
Jayanta Mukhopadhyay

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sinha, S., Jain, A., Agrawal, S.S. (2014). Speech Processing for Hindi Dialect Recognition. In: Thampi, S., Gelbukh, A., Mukhopadhyay, J. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 264. Springer, Cham. https://doi.org/10.1007/978-3-319-04960-1_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-04960-1_14
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04959-5
Online ISBN: 978-3-319-04960-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics