Skip to main content

Estimating Translation Probabilities from the Web for Structured Queries on CLIR

  • Conference paper
  • 2151 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5993))

Abstract

We present two methods for estimating replacement probabilities without using parallel corpora. The first method proposed exploits the possible translation probabilities latent in Machine Readable Dictionaries (MRD). The second method is more robust, and exploits context similarity-based techniques in order to estimate word translation probabilities using the Internet as a bilingual comparable corpus. The experiments show a statistically significant improvement over non weighted structured queries in terms of MAP by using the replacement probabilities obtained with the proposed methods. The context similarity-based method is the one that yields the most significant improvement.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pirkola, A.: The Effects of Query Structure and Dictionary Setups in Dictionary-Based Cross-Language Information Retrieval. In: SIGIR 1998, pp. 55–63 (1998)

    Google Scholar 

  2. Ballesteros, L., Croft, W.B.: Resolving Ambiguity for Cross-Language Retrieval. In: SIGIR 1998, pp. 64–71 (1998)

    Google Scholar 

  3. Hiemstra, D., De Jong, F.: Statistical Language Models and Information Retrieval: natural language processing really meets retrieval. University of Twente (2001)

    Google Scholar 

  4. Darwish, K., Oard, D.W.: Probabilistic structured Query Methods. In: SIGIR 2003 (2003)

    Google Scholar 

  5. Fung, P., Yuen Yee, L.: An IR Approach for Translating New Words from Nonparallel, Comparable Texts. In: COLING-ACL (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Saralegi, X., Lopez de Lacalle, M. (2010). Estimating Translation Probabilities from the Web for Structured Queries on CLIR. In: Gurrin, C., et al. Advances in Information Retrieval. ECIR 2010. Lecture Notes in Computer Science, vol 5993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12275-0_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12275-0_53

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12274-3

  • Online ISBN: 978-3-642-12275-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics