Skip to main content

Generating Term Transliterations Using Contextual Information and Validating Generated Results Using Web Corpora

  • Conference paper
Information Retrieval Technology (AIRS 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3689))

Included in the following conference series:

  • 988 Accesses

Abstract

Transliterating foreign entities into Chinese is usually done through direct-style approach. The direct-style approach transliterates each syllable rendered from foreign terms into Chinese directly. Not every syllable can be rendered. An approach utilizing contextual information for term transliteration is proposed in this paper to attack this problem. Traditionally, evaluating transliteration performances always uses character and word error rates. However, many transliteration variants of the same term always found. Validating the generated results using Web corpora is more suitable and is proposed in this paper. Using the proposed evaluation method, experiments on term transliteration were conducted. From the experimental results show that taking contextual information is helpful to term transliteration and validating the generated results using Web corpora can provide more concrete evidences to transliteration evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Gao, W., Wong, K.F., Lam, W.: Phoneme-based Transliteration of Foreign Names for OOV Problem. In: Proceedings of 1st International Joint Chinese Natural Language Processing, Hainan, China, pp. 374–381 (2004)

    Google Scholar 

  • Jurafsky, D., Martin, J.H.: Speech and Language Processing, pp. 102–120. Prentice-Hall, Englewood Cliffs (2000)

    Google Scholar 

  • Knight, K., Graehl, J.: Machine Transliteration. Computational Linguistics 24(4), 599–612 (1998)

    Google Scholar 

  • Kuo, J.S., Yang, Y.K.: Incorporating Pronunciation Variation into Extraction of Transliterated-term Pairs from Web Corpora. In: Proceedings of International Conference on Chinese Computing, Singapore, pp. 131–138 (2005)

    Google Scholar 

  • Li, H., Zhang, M., Su, J.: A Joint Source-Channel Model for Machine Transliteration. In: Proceedings of 42nd ACL, Barcelona, Spain, pp. 159–166 (2004)

    Google Scholar 

  • Meng, H., Lo, W.K., Chen, B., Tang, K.: Generating Phonetic Cognates to Handle Named Entities in English-Chinese Cross-Language Spoken Document Retrieval. In: Proceedings of the Automatic Speech Recognition and Understanding Workshop, pp. 311–314 (2001)

    Google Scholar 

  • Virga, P., Khudanpur, S.: Transliteration of Proper Names in Cross-Lingual Information Retrieval. In: Proceedings of 41st ACL Workshop on Multilingual and Mixed Language Named Entity Recognition, pp. 57–64 (2003)

    Google Scholar 

  • Wan, S., Verspoor, C.M.: Automatic English-Chinese Name Transliteration for Development of Multilingual Resources. In: Proceedings of 17th COLING and 36th ACL, Montreal Quebec, Canada, pp. 1352–1356 (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kuo, JS. (2005). Generating Term Transliterations Using Contextual Information and Validating Generated Results Using Web Corpora. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.H. (eds) Information Retrieval Technology. AIRS 2005. Lecture Notes in Computer Science, vol 3689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562382_64

Download citation

  • DOI: https://doi.org/10.1007/11562382_64

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29186-2

  • Online ISBN: 978-3-540-32001-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics