Abstract
Language transliteration is one of the important area in natural language processing. Machine Transliteration is the conversion of a character or word from one language to another without losing its phonological characteristics. It is an orthographical and phonetic converting process. Therefore, both grapheme and phoneme information should be considered. Accurate transliteration of named entities plays an important role in the performance of machine translation and cross-language information retrieval processes. The transliteration model must be design in such a way that the phonetic structure of words should be preserve as closely as possible. This paper address the problem of transliterating English to Kannada language using a publically available translation tool called Statistical Machine Translation (SMT).This transliteration technique was demonstrated for English to Kannada Transliteration and achieved exact Kannada transliterations for 89.27% of English names. The result of proposed model is compared with the SVM based transliteration system as well as Google Indic transliteration system.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ganesh, S., Harsha, S., Pingali, P., Varma, V.: Statistical Transliteration for Cross Langauge Information Retrieval using HMM alignment and CRF. In: Proceedings of the 2nd workshop on Cross Lingual Information Access (CLIA). IIIT Hyderabad, India (2008)
Vijaya, M.S., Loganathan, R., Shivapratap, G., Ajith, V.P., Soman, K.P.: In: International Conference on Asian Language Processing, Thailand (November 2008)
Giménezand, J., Màrquez, L.: SVMTtool: Technical manual v1.3 (August 2006)
Vapnik, V.N.: Statistical Learning Theory. J.Wiley & Sons, Inc., New York (1998)
Cortes, C., Haffner, P., Mohri, M.: A Machine Learning Framework For Spoken-Dialog Classification: Springer Handbook on Speech Processing and Speech Communication (2008)
Ramanathan, A.: Statistical Machine Translation.: Ph.D. Seminar Report. IIT-Bombay, India (2008)
Jurafsky, D., Martin, J.H.: Speech and Language Processing- An Introduction to Natural Language Processing. In: Computational Linguistics and Speech Recognition, pp. 799–801. Prentice Hall, Englewood Cliffs (2000)
Koehn, P.: MOSES a Beam-Search Decoder for Factored Phrase-Based Statistical Machine Translation Models User Manual and Code Guide. University of Edinburg, UK (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Antony, P.J., Ajith, V.P., Soman, K.P. (2010). Statistical Method for English to Kannada Transliteration. In: Das, V.V., et al. Information Processing and Management. BAIP 2010. Communications in Computer and Information Science, vol 70. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12214-9_57
Download citation
DOI: https://doi.org/10.1007/978-3-642-12214-9_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12213-2
Online ISBN: 978-3-642-12214-9
eBook Packages: Computer ScienceComputer Science (R0)