Skip to main content

Language Identification System for the Tatar Language

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8113))

Abstract

This paper describes a speech identification system for the Tatar, English and Russian languages. It also presents a newly created Tatar speech corpus, which is used for building a language model. The main idea is to investigate the potential of basic phonotactic approaches (i.e. PRLM-approach) when working with the Tatar language. The results indicate that the proposed system can be successfully employed for identifying the Tatar, English and Russian languages.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zissman, M.: Comparison of four approaches to automatic language identification of telephone speech. IEEE Transactions on Speech and Audio Processing 4 (1996)

    Google Scholar 

  2. Zissman, M., Singer, E.: Automatic language identification of telephone speech messages using phoneme recognition and n-gram modeling. In: Proc. ZCASSP 1994, vol. 1, pp. 305–308 (1994)

    Google Scholar 

  3. Lopes, C., Perdigao, F.: Phone recognition on TIMIT database. Speech Technologies, 285–302 (2011)

    Google Scholar 

  4. Young, S.: The HTK book (for HTK version 3.4) (2009)

    Google Scholar 

  5. Niesler, T., Willett, D.: Language identification and multilingual speech recognition using discriminatively trained acoustic models. In: ISCA Workshop on Multilingual, Speech and Language Processing, Stellenbosch, South Africa (2006)

    Google Scholar 

  6. Khusainov, A., Suleymanov, D.: Speech analysis platform prototype for Tatar language. Open Semantic Technologies for Intelligent Systems, Minsk, Belarus (2013)

    Google Scholar 

  7. Khusainov, A.: An overview of speech recognition approaches. In: Proceedings of the 14th International Conference “Speech and Computer”, Kazan, Russia (2011)

    Google Scholar 

  8. Karpov, A., Kipyatkova, I., Ronzhin, A.: Very Large Vocabulary ASR for Spoken Russian with Syntactic and Morphemic Analysis. In: Proceedings INTERSPEECH 2011 International Conference, ISCA Association, Florence, Italy, pp. 3161–3164 (2011)

    Google Scholar 

  9. Karpov, A., Kipyatkova, I., Ronzhin, A.: Speech Recognition for East Slavic Languages: The Case of Russian. In: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, SLTU 2012, Cape Town, RSA, pp. 84–89 (2012)

    Google Scholar 

  10. Matejka, P.: Phonotactic Language Identification using High Quality Phoneme Recognition. In: Proc. Eurospeech, Portugal (2005)

    Google Scholar 

  11. Martin, A., Le, A.: The Current State of Language Recognition: NIST 2005 Evaluation Results. In: IEEE Odyssey 2006, Puerto Rico (2006)

    Google Scholar 

  12. Gauvain, J., Messaoudi, A., Schwenk, H.: Language Recognition using Phone Lattices. In: Proc. ICSLP 2004 (2004)

    Google Scholar 

  13. Wong, K.-K., Siu, M.-H.: Automatic language identification using discrete hidden markov model. In: Proc. ICSLP, Jeju, Korea (2004)

    Google Scholar 

  14. Reynolds, D., Campbell, W., Shen, W., Singer, E.: Automatic Language Recognition Via Spectral and Token Based Approaches. Springer Handbook of Speech Processing, ch. 41, pp. 811–824 (2008)

    Google Scholar 

  15. Yonghong, Y., Barnard, E., Vermeulen, P.: Development of an Approach to Language Identification Based on Language-dependent Phone Recognition (1995)

    Google Scholar 

  16. Li, H., Ma, B., Lee, K.A.: Spoken Language Recognition: From Fundamentals to Practice. Proceedings of the IEEE 101(5), 1136–1159 (2013)

    Article  Google Scholar 

  17. Kirchhoff, K., Schultz, T.: Language characteristics. In: Multilingual Speech Processing, Amsterdam, The Netherlands (2006)

    Google Scholar 

  18. Ambikairajah, E., Li, H., Wang, L., Yin, B., Sethu, V.: Language identification: A tutorial. IEEE Circuits Syst. Mag. 11(2), 82–108 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Khusainov, A., Suleymanov, D. (2013). Language Identification System for the Tatar Language. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-01931-4_27

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-01930-7

  • Online ISBN: 978-3-319-01931-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics