Abstract
The search for features in topics and queries relevant for the performance in information retrieval is an important strategy for system optimization. Named entities in topics are a significant feature contributing to the quality of the retrieval results. In this contribution, we present an analysis on the correlation between the number of named entities present in a topic formulation and the final retrieval quality for these topics by retrieval systems within CLEF. The analysis includes the results of CLEF 2004. We found that a medium positive correlation exists for German, English and Spanish topics. Furthermore, the effect of the document or target language on the retrieval quality is also investigated.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Braschler, M., Peters, C.: Cross-Language Evaluation Forum: Objectives, Results, Achievements. Information Retrieval 2004 7, 7–31 (2004)
Voorhees, E., Buckland, L. (eds.): The Eleventh Text Retrieval Conference (TREC 2002). NIST Special Publication 500-251. National Institute of Standards and Technology, Gaithersburg, Maryland (November 2002), http://trec.nist.gov/pubs/trec11/t11_proceedings.html
Fuhr, N.: Initiative for the Evaluation of XML Retrieval (INEX): INEX 2003 Workshop Proceedings, Dagstuhl, Germany, December 15-17 (2003), http://purl.oclc.org/NET/duett-07012004-093151
Oyama, K., Ishida, E., Kando, N. (eds.): NTCIR Workshop3 Proceedings of the Third NTCIR Workshop on research in Information Retrieval, Automatic Text Summarization and Question Answering (September 2001-October 2002) (2003), http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings3/index.html
Downie, S.: Toward the Scientific Evaluation of Music Information Retrieval Systems. In: Intl. Symposium on Music Information Retrieval. Washington, D.C., & Baltimore, USA (2003), http://ismir2003.ismir.net/papers/Downie.PDF
Schneider, R., Mandl, T., Womser-Hacker, C.: Workshop LECLIQ: Lessons Learned from Evaluation: Towards Integration and Transparency in Cross-Lingual Information Retrieval with a special Focus on Quality Gates. In: 4th International Conference on Language Resources and Evaluation (LREC), Lisbon, Portugal, May 24-30, pp. 1–4
Evans, D., Shanahan, J., Sheftel, V.: Topic Structure Modeling. In: Proc. of the Annual Intl. ACM Conference on Research and Development in Information Retrieval (SIGIR 2002) Tampere, Finland, pp. 417–418 (2002)
Cronen-Townsend, S., Zhou, Y., Croft, W.: Predicting Query Ambiguity. In: Proc. of the Annual Intl. ACM Conference on Research and Development in Information Retrieval (SIGIR 2002) Tampere, Finland, pp. 299–306 (2002)
Mandl, T., Womser-Hacker, C.: Analysis of Topic Features in Cross-Language Information Retrieval Evaluation. In: 4th International Conference on Language Resources and Evaluation (LREC) Workshop Lessons Learned from Evaluation: Towards Transparency and Integration in Cross-Lingual Information Retrieval (LECLIQ), Lisbon, Portugal, May 24-30, pp. 17–19
Mandl, T., Womser-Hacker, C.: A Framework for long-term Learning of Topical User Preferences in Information Retrieval. New Library World 105(5/6), 184–195 (2004)
Sekine, S., Sudo, K., Nobata, C.: Extended Named Entity Hierarchy. In: Proceedings of Third International Conference on Language Resources and Evaluation (LREC 2002), Las Palmas, Canary Islands, Spain (2002)
Voorhees, E., Buckley, C.: The Effect of Topic Set Size on Retrieval Experiment Error. In: Proc. of the Annual Intl. ACM Conference on Research and Development in Information Retrieval (SIGIR 2002), Tampere, Finland, pp. 316–323 (2002)
Hackl, R., Kölle, R., Mandl, T., Ploedt, A., Scheufen, J.-H., Womser-Hacker, C.: Multilingual Retrieval Experiments with MIMOR at the University of Hildesheim. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 166–173. Springer, Heidelberg (2004)
Womser-Hacker, C.: Multilingual Topic Generation within the CLEF 2001 Experiments. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 389–393. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mandl, T., Womser-Hacker, C. (2005). How Do Named Entities Contribute to Retrieval Effectiveness?. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_81
Download citation
DOI: https://doi.org/10.1007/11519645_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)