Abstract
This paper presents an approach of a cross-lingual information retrieval which uses a ranking method based on a penalisation version of the Jaccard formula. The obtained results after the submission of a set of runs to the WebCLEF 2006 have shown that this simple ranking formula may be used in a cross-lingual environment. A comparison with runs submitted by other teams ranks us in a third place by using all the topics. A fourth place is obtained with our best overall results by using only the new topic set, and a second place was got by using only the automatic topics of the new topic set. An exact comparison with the rest of the participants is in fact difficult to obtain and, therefore, we consider that further detailed analysis of the components should be done in order to determine the best components of the proposed system.
This work was partially supported by the MCyT TIN2006-15265-C06-04 project, as well as by the BUAP-701 PROMEP/103.5/05/1536 grant.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Balog, K., Azzopardi, L., Kamps, J., de Rijke, M.: Overview of WebCLEF 2006, LNCS, vol. 4730, Springer, Heidelberg (2007)
Kraaij, W., Simard, M., Nie, J.Y.: Embedding Web-based Statistical Translation Models in Cross-Language Information Retrieval. Computational Linguistics 29(3), 381–419 (2003)
Pinto, D., Jiménez-Salazar, H., Rosso, P.: BUAP-UPV TPIRS: A System for Document Indexing Reduction on WebCLEF. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)
Salton, G.: Automatic Text Processing. Addison-Wesley, Reading (1989)
Sigurbjörnsson, B., Kamps, J., de Rijke, M.: EuroGOV: Engineering a Multilingual Web Corpus. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pinto, D., Rosso, P., Jiménez, E. (2007). A Penalisation-Based Ranking Approach for the Mixed Monolingual Task of WebCLEF 2006. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_103
Download citation
DOI: https://doi.org/10.1007/978-3-540-74999-8_103
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74998-1
Online ISBN: 978-3-540-74999-8
eBook Packages: Computer ScienceComputer Science (R0)