Abstract
This paper describes a speech identification system for the Tatar, English and Russian languages. It also presents a newly created Tatar speech corpus, which is used for building a language model. The main idea is to investigate the potential of basic phonotactic approaches (i.e. PRLM-approach) when working with the Tatar language. The results indicate that the proposed system can be successfully employed for identifying the Tatar, English and Russian languages.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Zissman, M.: Comparison of four approaches to automatic language identification of telephone speech. IEEE Transactions on Speech and Audio Processing 4 (1996)
Zissman, M., Singer, E.: Automatic language identification of telephone speech messages using phoneme recognition and n-gram modeling. In: Proc. ZCASSP 1994, vol. 1, pp. 305–308 (1994)
Lopes, C., Perdigao, F.: Phone recognition on TIMIT database. Speech Technologies, 285–302 (2011)
Young, S.: The HTK book (for HTK version 3.4) (2009)
Niesler, T., Willett, D.: Language identification and multilingual speech recognition using discriminatively trained acoustic models. In: ISCA Workshop on Multilingual, Speech and Language Processing, Stellenbosch, South Africa (2006)
Khusainov, A., Suleymanov, D.: Speech analysis platform prototype for Tatar language. Open Semantic Technologies for Intelligent Systems, Minsk, Belarus (2013)
Khusainov, A.: An overview of speech recognition approaches. In: Proceedings of the 14th International Conference “Speech and Computer”, Kazan, Russia (2011)
Karpov, A., Kipyatkova, I., Ronzhin, A.: Very Large Vocabulary ASR for Spoken Russian with Syntactic and Morphemic Analysis. In: Proceedings INTERSPEECH 2011 International Conference, ISCA Association, Florence, Italy, pp. 3161–3164 (2011)
Karpov, A., Kipyatkova, I., Ronzhin, A.: Speech Recognition for East Slavic Languages: The Case of Russian. In: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, SLTU 2012, Cape Town, RSA, pp. 84–89 (2012)
Matejka, P.: Phonotactic Language Identification using High Quality Phoneme Recognition. In: Proc. Eurospeech, Portugal (2005)
Martin, A., Le, A.: The Current State of Language Recognition: NIST 2005 Evaluation Results. In: IEEE Odyssey 2006, Puerto Rico (2006)
Gauvain, J., Messaoudi, A., Schwenk, H.: Language Recognition using Phone Lattices. In: Proc. ICSLP 2004 (2004)
Wong, K.-K., Siu, M.-H.: Automatic language identification using discrete hidden markov model. In: Proc. ICSLP, Jeju, Korea (2004)
Reynolds, D., Campbell, W., Shen, W., Singer, E.: Automatic Language Recognition Via Spectral and Token Based Approaches. Springer Handbook of Speech Processing, ch. 41, pp. 811–824 (2008)
Yonghong, Y., Barnard, E., Vermeulen, P.: Development of an Approach to Language Identification Based on Language-dependent Phone Recognition (1995)
Li, H., Ma, B., Lee, K.A.: Spoken Language Recognition: From Fundamentals to Practice. Proceedings of the IEEE 101(5), 1136–1159 (2013)
Kirchhoff, K., Schultz, T.: Language characteristics. In: Multilingual Speech Processing, Amsterdam, The Netherlands (2006)
Ambikairajah, E., Li, H., Wang, L., Yin, B., Sethu, V.: Language identification: A tutorial. IEEE Circuits Syst. Mag. 11(2), 82–108 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Khusainov, A., Suleymanov, D. (2013). Language Identification System for the Tatar Language. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-01931-4_27
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01930-7
Online ISBN: 978-3-319-01931-4
eBook Packages: Computer ScienceComputer Science (R0)