Skip to main content

Tamil to Hindi Machine Transliteration Using Support Vector Machines

  • Conference paper
Signal Processing and Information Technology (SPIT 2011)

Abstract

Transliteration is the process of replacing the characters in one language with the corresponding phonetically equivalent characters of the other language. India is a language diversified country where people speak and understand many languages but does not know the script of some of these languages. Transliteration plays a major role in such cases. Transliteration has been a supporting tool in machine translation and cross language information retrieval systems as most of the proper nouns are out of vocabulary words. In this paper, a sequence learning method for transliterating named entities from Tamil to Hindi is proposed. Through this approach, accuracy obtained is encouraging. This transliteration system can be embedded with Tamil to Hindi machine translation system in future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Virga, P., Khudanpur, S.: Transliteration of Proper Names in Cross-Language Applications. In: 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2003)

    Google Scholar 

  2. Kumaran, A., Kellner, T.: A Generic Framework for Machine Transliteration. In: The 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2007)

    Google Scholar 

  3. Kumaran, A., Khapra, M.M., Bhattacharyya, P.: Compositional Machine Transliteration. ACM Transactions on Asian Language Information (2010)

    Google Scholar 

  4. Vijaya, M.S., Shivapratap, G., Dhanakshmi, V., Ajith, V.P., Soman, K.P.: Sequence labeling approach for English to Tamil Transliteration using Memory based Learning. In: International Conference on Natural Language Processing (2009)

    Google Scholar 

  5. Wikipedia, http://en.wikipedia.org/wiki/Support_vector_machine

  6. Lin, W.-H., Chen, H.-H.: Backward Machine Transliteration by Learning Phonetic Similarity. In: The 6th Conference on Natural Language Learning, vol. 20 (2002)

    Google Scholar 

  7. TALP Research Center NLP group, http://www.lsi.upc.edu/~nlp/SVMTool/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering

About this paper

Cite this paper

Keerthana, S., Dhanalakshmi, V., Anand Kumar, M., Ajith, V.P., Soman, K.P. (2012). Tamil to Hindi Machine Transliteration Using Support Vector Machines. In: Das, V.V., Ariwa, E., Rahayu, S.B. (eds) Signal Processing and Information Technology. SPIT 2011. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 62. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32573-1_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32573-1_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32572-4

  • Online ISBN: 978-3-642-32573-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics