Overview of WebCLEF 2006

Balog, Krisztian; Azzopardi, Leif; Kamps, Jaap; de Rijke, Maarten

doi:10.1007/978-3-540-74999-8_101

Krisztian Balog¹,
Leif Azzopardi³,
Jaap Kamps^1,2 &
…
Maarten de Rijke¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4730))

Included in the following conference series:

Workshop of the Cross-Language Evaluation Forum for European Languages

529 Accesses
9 Citations

Abstract

We report on the CLEF 2006 WebCLEF track devoted to crosslingual web retrieval. We provide details about the retrieval tasks, the used topic set, and the results of the participants. WebCLEF 2006 used a stream of known-item topics consisting of: (i) manual topics (including a selection of WebCLEF 2005 topics, and a set of new topics) and (ii) automatically generated topics (generated using two techniques). The results over all topics show that current CLIR systems are very effective, retrieving on average the target page in the top ranks. Manually constructed topics result in higher performance than and automatically generated ones. And finally, the resulting scores on automatic topics provide a reasonable ranking of the systems, showing that automatically generated topics are an attractive alternative in situations where manual topics are not readily available.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Amitay, E., Carmel, D., Lempel, R., Soffer, A.: Scaling ir-system evaluation using term relevance sets. In: Proceedings of the 27th annual international ACM SIGIR conference on Research and Development in Information Retrieval, pp. 10–17. ACM Press, New York (2004)
Google Scholar
Azzopardi, L., de Rijke, M.: Automatic construction of known-item finding test beds. In: Proceedings of the 29th annual international ACM SIGIR conference on Research and Development in Information Retrieval, pp. 603–604. ACM Press, New York (2006)
Chapter Google Scholar
Broder, A.: A taxonomy of web search. SIGIR Forum 36(2), 3–10 (2002)
Article Google Scholar
Eurobarometer. Europeans and their languages. Special Eurobarometer 243, European Commision, URL (2006), http://ec.europa.eu/public_opinion/archives/ebs/ebs_243_en.pdf
Gey, F.C., Kando, N., Peters, C.: Cross language information retrieval: a research roadmap. SIGIR Forum 36(2), 72–80 (2002)
Article Google Scholar
Lemur: The Lemur toolkit for language modeling and information retrieval (2005), URL: http://www.lemurproject.org/
Miller, D.R., Leek, T., Schwartz, R.M.: A hidden Markov model information retrieval system. In: Proceedings of SIGIR 1999, 22nd ACM International Conference on Research and Development in Information Retrieval, Berkeley, US, pp. 214–221. ACM Press, New York (1999)
Chapter Google Scholar
Sigurbjörnsson, B., Kamps, J., de Rijke, M.: EuroGOV: Engineering a multilingual Web corpus. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)
Google Scholar
Sigurbjörnsson, B., Kamps, J., de Rijke, M.: Overview of WebCLEF 2005. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)
Google Scholar
Soboroff, I., Nicholas, C., Cahan, P.: Ranking retrieval systems without relevance judgments. In: SIGIR 2001: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 66–73. ACM Press, New York (2001)
Chapter Google Scholar
Voorhees, E.M.: Variations in relevance judgments and the measurement of retrieval effectiveness. In: SIGIR 1998: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, pp. 315–323. ACM Press, New York (1998)
Chapter Google Scholar
WebCLEF. Cross-lingual web retrieval (2006), URL http://ilps.science.uva.nl/WebCLEF/

Download references

Author information

Authors and Affiliations

ISLA, University of Amsterdam,
Krisztian Balog, Jaap Kamps & Maarten de Rijke
Archive and Information Studies, University of Amsterdam,
Jaap Kamps
Department of Computing Science, University of Glasgow,
Leif Azzopardi

Authors

Krisztian Balog
View author publications
You can also search for this author in PubMed Google Scholar
Leif Azzopardi
View author publications
You can also search for this author in PubMed Google Scholar
Jaap Kamps
View author publications
You can also search for this author in PubMed Google Scholar
Maarten de Rijke
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Carol Peters Paul Clough Fredric C. Gey Jussi Karlgren Bernardo Magnini Douglas W. Oard Maarten de Rijke Maximilian Stempfhuber

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Balog, K., Azzopardi, L., Kamps, J., de Rijke, M. (2007). Overview of WebCLEF 2006. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_101

Download citation

DOI: https://doi.org/10.1007/978-3-540-74999-8_101
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74998-1
Online ISBN: 978-3-540-74999-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics