Skip to main content

The English Unknown Term Translation Mining with Improved Bilingual Snippets Collection Strategy

  • Conference paper
Intelligent Computing Theories and Applications (ICIC 2012)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7390))

Included in the following conference series:

  • 2608 Accesses

Abstract

Traditional bilingual snippets retrieval method is to select the top N snippets returned from web search engine as bilingual corpora. In this paper, an improved bilingual snippets retrieval method is proposed. It combines term expansion, the surface pattern matching model and top N bilingual snippets selection to retrieve bilingual snippets. More relative bilingual snippets can be found by using term expansion, and surface pattern matching is useful to select bilingual snippets according to the constitution of unknown term translation. The top 100 bilingual snippets and those satisfy the surface pattern matching model serve as the last bilingual corpora for term translation mining. Experimental results show that the improved bilingual snippets retrieving method improves the top 100 inclusion rate by 2.3% than baseline system, which verified that the improved bilingual snippets retrieving method is effective.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ballesteros, L., Croft, W.B.: Phrasal Translation and Query Expansion Techniques for Cross-Language Information Retrieval. In: Proc. of 20th International ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, USA, pp. 84–91 (1997)

    Google Scholar 

  2. Gamallo Otero, P.: Using Natural Alignment to Extract Translation Equivalents. In: Vieira, R., Quaresma, P., das Graças Volpe Nunes, M., Mamede, N.J., Oliveira, C., Dias, M.C. (eds.) PROPOR 2006. LNCS (LNAI), vol. 3960, pp. 41–49. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  3. Nagata, M., Saito, T., Suzuki, K.: Using the web as a bilingual dictionary. In: Proc. ACL 2001 Workshop Data-Driven Methods in Machine Translation, pp. 95–102 (2001)

    Google Scholar 

  4. Cheng, P.J., Teng, J.W., Chen, R.C., Wang, J.H., Lu, W.H., Chien, L.F.: Translating un-known queries with web corpora for cross-language information retrieval. In: The Proceedings of 27th ACM SIGIR, pp. 146–153. ACM Press (2004)

    Google Scholar 

  5. Lu, C.Y., Xu,Y., Geva, S.: Web-Based Query Translation fro English-Chinese CLIR. Computational Linguistics and Chinese Language Processing 13(1), 61–90 (2008)

    Google Scholar 

  6. Huang, F., Zhang, Y., Vogel, S.: Mining Key Phrase Transla-tion from Web Corpora. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pp. 483–490 (2005)

    Google Scholar 

  7. Fang, G.L., Yu, H., Nishino, F.: Chinese-English Term Translation Mining Based on Semantic Prediction. In: Proceedings of the COLING/ACL 2006 Main Conference Poster Session, pp. 199–206 (2006)

    Google Scholar 

  8. Sun, J., Yao, J.M., Zhang, J., Zhu, Q.M.: Web Mining of OOV translations. Journal of Information & Computational Science 5(1), 1–6 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liang, YH., Li, JX., Ye, L., Chen, K., Guo, CZ. (2012). The English Unknown Term Translation Mining with Improved Bilingual Snippets Collection Strategy. In: Huang, DS., Ma, J., Jo, KH., Gromiha, M.M. (eds) Intelligent Computing Theories and Applications. ICIC 2012. Lecture Notes in Computer Science(), vol 7390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31576-3_52

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31576-3_52

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31575-6

  • Online ISBN: 978-3-642-31576-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics