Abstract
This paper describes the official runs of the Twenty-One group for the first CLEF workshop. The Twenty-One group participated in the monolingual, bilingual and multilingual tasks. The following new techniques are introduced in this paper. In the bilingual task we experimented with different methods to estimate translation probabilities. In the multilingual task we experimented with refinements on raw-score merging techniques and with a new relevance feedback algorithm that re-estimates both the model’s translation probabilities and the relevance weights. Finally, we performed preliminary experiments to exploit the web to generate translation probabilities and bilingual dictionaries, notably for English-Italian and English-Dutch.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
L. Ballesteros and W. B. Croft. Resolving ambiguity for cross-language retrieval. In Proceedings of the 21st ACM SIGIR Conference Research and Development in Information Retrieval (SIGIR’98), pages 64–71, 1998.
D. Hiemstra and F.M.G. de Jong. Disambiguation strategies for cross-language information retrieval. In Proceedings of the third European Conference on Research and Advanced Technology for Digital Libraries (ECDL’99), pages 274–293, 1999.
D. Hiemstra and W. Kraaij. Twenty-One at TREC-7: Ad-hoc and cross-language track. In Proceedings of the seventh Text Retrieval Conference TREC-7, NIST Special Publication 500-242, pages 227–238, 1999.
D. Hiemstra. A probabilistic justification for using tf.idf term weighting in information retrieval. International Journal on Digital Libraries, 3(2):131–139, 2000.
W. Kraaij, R. Pohlmann, and D. Hiemstra. Twenty-one at TREC-8: using language technology for information retrieval. In Proceedings of the eighth Text Retrieval Conference TREC-8, NIST Special Publication 500-246, pages 285–300, 2000.
W. Kraaij and R. Pohlmann. Viewing stemming as recall enhancement. In H.P. Frei, D. Harman, P. Schäuble, and R. Wilkinson, editors, Proceedings of the 19th ACMSIGIR Conference on Research and Development in Information Retrieval (SIGIR’ 96), pages 40–48, 1996.
J.Y. Nie. Using parallel web pages for multilingual information retrieval. In Proceedings of CLEF 2000, 2000 (elsewhere in this volume).
J.Y. Nie, M. Simard, P. Isabelle, and R. Durand. Cross-language information retrieval based on parallel texts and automatic mining of parallel texts in the web. In ACM-SIGIR’99, pages 74–81, 1999.
R. Pohlmann and W. Kraaij. The effect of syntactic phrase indexing on retrieval performance for Dutch texts. In L. Devroye and C. Chrisment, editors, Proceedings of RIAO’97, pages 176–187, 1997.
A. Pirkola. The effects of query structure and dictionary setups in dictionarybased cross-language information retrieval. In 21st ACM Conference on Research and Development in Information Retrieval (SIGIR), pages 55–63, 1998.
S.E. Robertson and K. Sparck Jones. Relevance weighting of search terms. Journal of the American Society for Information Science, 27:129–146, 1976.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hiemstra, D., Kraaij, W., Pohlmann, R., Westerveld, T. (2001). Translation Resources, Merging Strategies, and Relevance Feedback for Cross-Language Information Retrieval. In: Peters, C. (eds) Cross-Language Information Retrieval and Evaluation. CLEF 2000. Lecture Notes in Computer Science, vol 2069. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44645-1_10
Download citation
DOI: https://doi.org/10.1007/3-540-44645-1_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42446-8
Online ISBN: 978-3-540-44645-3
eBook Packages: Springer Book Archive