Abstract
Language identification (LID) based on phonotactic modeling is presented in this paper. Approaches using phoneme strings and strings of units automatically derived by an Ergodic HMM (EHMM) are compared. The phoneme recognizers were trained on 6 languages from OGI multi-language-corpus and Czech SpeechDat-E. The LID results are obtained on 4 languages. The results show superiority of Czech phoneme recognizer while used in LID and promising trends using the EHMM-derived units.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Schwarz, P., Matějka, P., Cěrnocký, J.: Recognition of Phoneme Strings using TRAP Technique. In: Proc. EuroSpeech 2003, September 2003, pp. 825–828 (2003)
Sharma, S., Ellis, D., Karajekar, S., Jain, P., Hermansky, H.: Feature extraction using non-linear transformation for robust speech recognition on the Aurora database. In: Proc. ICASSP 2000, Turkey (2000)
Hermansky, H., Sharma, S.: Temporal Patterns (TRAPS) in ASR of Noisy Speech. In: Proc. ICASSP 1999, Phoenix, Arizona, USA (March 1999)
Schwarz, P., Matějka, P., Černocký, J.: Towards Lower Error Rates in Phoneme Recognition. In: Sojka, P., et al. (eds.) Text, Speech and Dialogue, Proceedings of the Seventh International Conference, Brno, Czech Republic, September 8-11, p. 221 (2004)
Szöke, I., Cěrnocký, J.: Speech Units Automatically Generated by Ergodic Hidden Markov Model. Submitted to EEICT (2004)
Cěrnocký, J., Baudoin, G., Chollet, G.: Segmental vocoder - going beyond the phonetic approach. In: Proc. IEEE ICASSP 1998, May 1998, pp. 605–608 (1998)
The SPRACHcore software packages, http://www.icsi.berkeley.edu/~dpwe/projects/sprach
HTK toolkit, http://htk.eng.cam.ac.uk
OGI MultiLanguage Telephone Speech (January 2004), http://www.cslu.ogi.edu/corpora/mlts/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Matějka, P., Szöke, I., Schwarz, P., Černocký, J. (2004). Automatic Language Identification Using Phoneme and Automatically Derived Unit Strings. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2004. Lecture Notes in Computer Science(), vol 3206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30120-2_19
Download citation
DOI: https://doi.org/10.1007/978-3-540-30120-2_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23049-6
Online ISBN: 978-3-540-30120-2
eBook Packages: Springer Book Archive