Abstract
In this work we attempted to determine the relative importance of the geographical and WordNet-extracted terms with respect to the remainder of the query. In our system, geographical terms are expanded with WordNet holonyms and synonyms and indexed separately. We checked the relative importance of the terms by multiplying their weight by 0.75, 0.5 and 0.25. The comparison to the baseline system, which uses only Lucene, shows that in some cases it is possible to improve the mean average precision by balancing the relative importance of geographical terms with respect to the content words in the query. We also observed that WordNet holonyms may help in improving the recall but WordNet has a small coverage and term expansion is sensible to ambiguous place names.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Miller, G.A.: Wordnet: A lexical database for english. Communications of the ACM 38, 39–41 (1995)
Buscaldi, D., Rosso, P., Sanchis, E.: Using the WordNet Ontology in the GeoCLEF Geographical Information Retrieval Task. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 939–946. Springer, Heidelberg (2006)
Gey, F.C., Larson, R., Sanderson, M., Joho, H., Clough, P.: GeoCLEF: the CLEF 2005 Cross-Language Geographic Information Retrieval Track. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 908–919. Springer, Heidelberg (2006)
Buscaldi, D., Rosso, P., Sanchis, E.: A WordNet-based Indexing Technique for Geographical Information Retrieval. In: Peters, C., Gey, F.C., Gonzalo, J., Mller, H., Jones, G.J., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2006. LNCS, vol. 4730, pp. 954–957. Springer, Heidelberg (2007)
Garbin, E., Mani, I.: Disambiguating toponyms in news. In: Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT 2005), Morristown, NJ, USA, pp. 363–370. Association for Computational Linguistics (2005)
Overell, S., Rüger, S.: Geographic co-occurrence as a tool for GIR. In: GIR 2007: Proceedings of the 4th ACM workshop on Geographical information retrieval, pp. 71–76. ACM, New York (2007)
Buscaldi, D., Rosso, P.: A conceptual density-based approach for the disambiguation of toponyms. International Journal of Geographical Information Systems (accepted, to be published, 2008)
Leveling, J., Veiel, D.: Experiments on the Exclusion of Metonymic Location Names from GIR. In: Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., Stempfhuber, M. (eds.) CLEF 2006. LNCS, vol. 4730, pp. 901–904. Springer, Heidelberg (2007)
Cardoso, N., Silva, M.J.: Query expansion through geographical feature types. In: GIR 2007: Proceedings of the 4th ACM workshop on Geographical information retrieval, pp. 55–60. ACM, New York (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Buscaldi, D., Rosso, P. (2008). On the Relative Importance of Toponyms in GeoCLEF. In: Peters, C., et al. Advances in Multilingual and Multimodal Information Retrieval. CLEF 2007. Lecture Notes in Computer Science, vol 5152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85760-0_103
Download citation
DOI: https://doi.org/10.1007/978-3-540-85760-0_103
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85759-4
Online ISBN: 978-3-540-85760-0
eBook Packages: Computer ScienceComputer Science (R0)