Abstract
Transliteration is the process of replacing the characters in one language with the corresponding phonetically equivalent characters of the other language. India is a language diversified country where people speak and understand many languages but does not know the script of some of these languages. Transliteration plays a major role in such cases. Transliteration has been a supporting tool in machine translation and cross language information retrieval systems as most of the proper nouns are out of vocabulary words. In this paper, a sequence learning method for transliterating named entities from Tamil to Hindi is proposed. Through this approach, accuracy obtained is encouraging. This transliteration system can be embedded with Tamil to Hindi machine translation system in future.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Virga, P., Khudanpur, S.: Transliteration of Proper Names in Cross-Language Applications. In: 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2003)
Kumaran, A., Kellner, T.: A Generic Framework for Machine Transliteration. In: The 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2007)
Kumaran, A., Khapra, M.M., Bhattacharyya, P.: Compositional Machine Transliteration. ACM Transactions on Asian Language Information (2010)
Vijaya, M.S., Shivapratap, G., Dhanakshmi, V., Ajith, V.P., Soman, K.P.: Sequence labeling approach for English to Tamil Transliteration using Memory based Learning. In: International Conference on Natural Language Processing (2009)
Wikipedia, http://en.wikipedia.org/wiki/Support_vector_machine
Lin, W.-H., Chen, H.-H.: Backward Machine Transliteration by Learning Phonetic Similarity. In: The 6th Conference on Natural Language Learning, vol. 20 (2002)
TALP Research Center NLP group, http://www.lsi.upc.edu/~nlp/SVMTool/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Keerthana, S., Dhanalakshmi, V., Anand Kumar, M., Ajith, V.P., Soman, K.P. (2012). Tamil to Hindi Machine Transliteration Using Support Vector Machines. In: Das, V.V., Ariwa, E., Rahayu, S.B. (eds) Signal Processing and Information Technology. SPIT 2011. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 62. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32573-1_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-32573-1_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32572-4
Online ISBN: 978-3-642-32573-1
eBook Packages: Computer ScienceComputer Science (R0)