Skip to main content

Retrieval Effectiveness of Cross Language Information Retrieval Search Engines

  • Conference paper
Digital Libraries: For Cultural Heritage, Knowledge Dissemination, and Future Creation (ICADL 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7008))

Included in the following conference series:

Abstract

This study evaluates the retrieval effectiveness of English-Chinese (EC) cross-language information retrieval (CLIR) on four common search engines along the dimensions of recall and precision. We formulated a set of simple and complex queries on different topics including queries with translation ambiguity. Three independent bilingual proficient evaluators reviewed a total of 960 returned web pages each to assess document relevance. Findings showed that CLIR effectiveness is poor with average recall and precision values of 0.165 and 0.539 for monolingual EE/CC searches, and 0.078 and 0.282 for cross lingual CE/EC searches. Google outperformed Yahoo! in the experiments, and EC and EE searches returned better results than CE and CC results respectively. As this is the first set CLIR retrieval effectiveness measurements reported in literature, these findings can serve as a benchmark and provide a better understanding of the current CLIR capabilities of Web search engines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hansen, P., Petrelli, D., Beaulieu, M., Sanderson, M.: User-centred interface design for cross-language information retrieval. In: 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 383–384. IEEE Press, New York (2002)

    Google Scholar 

  2. Kralisch, A., Mandl, T.: Barriers to information access across languages on the internet: network and language effects. In: 39th Annual Hawaii International Conference on Systems Sciences, vol. 3, pp. 4–7 (2006)

    Google Scholar 

  3. Clarke, J.S., Willett, P.: Estimating the recall performance of web search engines. Aslib Proceedings 49(7), 184–189 (1997)

    Article  Google Scholar 

  4. Youssef, M.: Cross language information retrieval: Universal usability in practice. Department of Computer Science, University of Maryland (2001), http://otal.umd.edu/uupractice/clir/#r11 (retrieved May 30, 2011)

  5. Kumar, B.T., Prakash, J.N.: Precision and relative recall of search engines: A comparative study of Google and Yahoo. Singapore Journal of Library & Information Management 38, 124–137 (2009)

    Google Scholar 

  6. Zhang, J., Lin, S.Y.: Multiple language supports in search engines. Online Information Review 31(4), 516–532 (2007)

    Article  Google Scholar 

  7. Ogden, W.C., Cowie, J., Davis, M., Ludovik, E., Nirenburg, S., Molina-Salgado, H., et al.: Keizai: An interactive cross-language text retrieval system. In: Ananiadou, S., Hayashi, Y., Jacquemin, C., Leong, M.K., Chen, H.H. (eds.) Workshop on Machine Translation for Cross Language Information (1999), http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.37.5775&rep=rep1&type=pdf (retrieved May 30, 2011)

  8. Ogden, W.C., Davis, M.W.: Improving cross-language text retrieval with human interactions. In: 33rd Annual Hawaii International Conference on System Sciences, vol. (3), p. 3044 (2000)

    Google Scholar 

  9. Capstick, J., Diagne, A.K., Erbach, G., Uszkoreit, H., Leisenberg, A., Leisenberg, M.: A system for supporting cross-lingual information retrieval. Information Processing & Management 36(2), 275–289 (2000)

    Article  Google Scholar 

  10. Peñas, A., Gonzalo, J., Verdejo, F., Lenguajes, D., Informáticos, S.: Cross-language information access through phrase browsing. In: Applications of Natural Language to Information Systems. LNI, pp. 121–130 (2001)

    Google Scholar 

  11. Airio, E.: Who benefits from CLIR in web retrieval. Journal of Documentation 64(5), 760–778 (2007)

    Article  Google Scholar 

  12. Gey, F., Kando, N., Peters, C.: Cross-language information retrieval: the way ahead. Information Processing and Management 41, 415–431 (2005)

    Article  Google Scholar 

  13. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval, 2nd edn. Addison-Wesley, Reading (2010)

    Google Scholar 

  14. Manning, C.D., Raghavan, P., Schutze, H.: Introduction to information retrieval. Cambridge University Press, Cambridge (2008)

    Book  MATH  Google Scholar 

  15. Foo, S., Li, H.: Chinese word segmentation and its effect on information retrieval. Information Processing and Management 40(1), 161–190 (2004)

    Article  Google Scholar 

  16. Shafi, S., Rather, R.: Precision and recall of five search engines for retrieval of scholarly information in the field of biotechnology. Webology 2(2), Article 15 (2005), http://www.webology.org/2005/v2n2/a12.html (retrieved May 30, 2011)

  17. Croft, B., Metzler, D., Strohman, T.: Search engines: Information retrieval in practice. Addison Wesley, Reading (2009)

    Google Scholar 

  18. Chinchor, N., Dungca, G.: Four scores and seven years ago: The scoring method for MUC-6. In: Proc. MUC-6 Conference, Columbia, MD (1995)

    Google Scholar 

  19. Makhoul, J., Kubala, F., Schwartz, R., Weischedel, R.: Performance measures for information extractors. In: Proc. DARPA Broadcast News Workshop, Herndon, VA, USA (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Foo, S. (2011). Retrieval Effectiveness of Cross Language Information Retrieval Search Engines. In: Xing, C., Crestani, F., Rauber, A. (eds) Digital Libraries: For Cultural Heritage, Knowledge Dissemination, and Future Creation. ICADL 2011. Lecture Notes in Computer Science, vol 7008. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24826-9_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24826-9_37

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24825-2

  • Online ISBN: 978-3-642-24826-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics