ABSTRACT
Toponyms in texts and search queries are often used figuratively and do not directly refer to the locations they reference in their literal sense. Different usage kinds and stylistic devices characterize toponym usages in texts. It is thus crucial for a Geographic Information Retrieval (GIR) system to precisely distinguish these different toponym usages at indexing and at query time in order to best address a given information need and the geospatial footprint of a document.
For that purpose, we analyze which of the classic stylistic devices such as allegories, metaphors, or metonymies are used together with toponyms. We use these categories as a foundation for a systematic approach towards the characterization of toponym usages in texts which we believe is necessary to further boost retrieval effectiveness of future GIR systems. A prototype implements this characterization exemplary for texts written in German. We evaluate the effectiveness of our approach against a reference corpus to show the general feasibility. Our approach provides a basis for a wide range of more sophisticated applications such as for example text genre detection.
- D. Bamman, B. O'Connor, and N. A. Smith. Learning Latent Personas of Film Characters. In Proc. of the 51st Annual Meeting of the Association for Computational Linguistics, pages 352--361, Sofia, Bulgaria, 2013. ACL.Google Scholar
- D. Buscaldi and P. Rosso. A conceptual density-based approach for the disambiguation of toponyms. Intl. Journal of Geographical Information Science, 22(3):301--313, Mar. 2008. Google ScholarDigital Library
- H. Cunningham and D. Maynard. GATE: an architecture for development of robust HLT applications. In Proc. of the 40th Annual Meeting of the Association for Computational Linguistics, pages 168--175, Philadelphia, PA, USA, 2002. ACL. Google ScholarDigital Library
- J. Finkel, T. Grenager, and C. Manning. Incorporating non-local information into information extraction systems by Gibbs sampling. In Proc. of the 43nd Annual Meeting of the Association for Computational Linguistics, pages 363--370, Ann Arbor, MI, USA, 2005. ACL. Google ScholarDigital Library
- J. Gawryjolek, C. Dimarco, and R. Harris. An Annotation Tool for Automatically Detecting Rhetorical Figures. In Proc. of the IJCAI-09 Workshop on Computational Models of Natural Argument, http://www.cmna.info/CMNA9/proceedings/CMNA9-Gawryjolek%20et%20al.pdf, last visit: 25.8.14, Pasadena, CA, USA, 2009.Google Scholar
- B. Hamp and H. Feldweg. GermaNet - A Lexical-Semantic Net for German. In Proc. of ACL Workshop Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications, pages 9--15, Madrid, Spain, 1997.Google Scholar
- A. Henrich, V. Lüdecke, and D. Blank. Approaches for determining the geographic footprint of arbitrary terms for retrieval and visualization. In Proceedings of the 16th ACM SIGSPATIAL Intl. Conf. on Advances in Geographic Information Systems, GIS '08, pages 1--4, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
- A. R. Kelly, N. A. Abbott, R. A. Harris, C. DiMarco, and D. R. Cheriton. Toward an ontology of rhetorical figures. In Proc. of the 28th Intl. Conf. on Design of Communication, pages 123--130, Sao Carlos, Sao Paulo, Brazil, 2010. ACM. Google ScholarDigital Library
- L. Kolmer and C. Rob-Santer. Textbook Rhetoric (in German). Verlag Ferdinand Schöningh, Paderborn, 2002.Google Scholar
- J. Leveling and S. Hartrumpf. On metonymy recognition for geographic information retrieval. Intl. Journal of Geographical Information Science, 22(3):289--299, Mar. 2008. Google ScholarDigital Library
- D. Marcu. The rhetorical parsing of natural language texts. In Proc. of the 35th Annual Meeting of the Association for Computational Linguistics, pages 96--103, Madrid, Spain, 1997. ACL. Google ScholarDigital Library
- K. Markert and U. Hahn. Understanding metonymies in discourse. Artificial Intelligence, 135(1-2):145--198, Feb. 2002. Google ScholarDigital Library
- K. Markert and M. Nissim. Data and models for metonymy resolution. Language Resources and Evaluation, 43(2):123--138, Feb. 2009.Google ScholarCross Ref
- V. Nastase, A. Judea, K. Markert, and M. Strube. Local and global context for supervised and unsupervised metonymy resolution. In Proc. of the Joint Conf. on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 183--193, Jeju Island, Korea, 2012. ACL. Google ScholarDigital Library
- H. P. Nii. Blackboard Systems. Technical report, Stanford University, CA, USA, CS-TR-86-1123, 1986. Google ScholarDigital Library
- M. Nissim and K. Markert. Syntactic features and word similarity for supervised metonymy resolution. In Proc. of the 41st Annual Meeting of the Association for Computational Linguistics, pages 56--63, Sapporo, Japan, 2003. ACL. Google ScholarDigital Library
- P. Perera and R. Witte. A Self-Learning Context-Aware Lemmatizer for German. In Proc. of the Conf. on Human Language Technology and Empirical Methods in Natural Language Processing, pages 636--643, Vancouver, BC, Canada, 2005. ACL. Google ScholarDigital Library
- H. Schmid. Probabilistic part-of-speech tagging using decision trees. In Proc. of the Intl. Conf. on New Methods in Language Processing, Manchester, UK, 1994.Google Scholar
- H. Schmid. Improvements in part-of-speech tagging with an application to German. In Proc. of the SIGDAT-Workshop, Dublin, Ireland, 1995. ACL.Google Scholar
- R. Sennrich, G. Schneider, M. Volk, and M. Warin. A new hybrid dependency parser for German. In Proc. of the Biannual Meeting of the German Society for Computational Linguistics and Language Technology, pages 115--124, Potsdam, Germany, 2009. GSCL.Google Scholar
Index Terms
- Characterization of toponym usages in texts
Recommendations
Toponym ambiguity in geographical information retrieval
SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrievalThe objectives of this research work is to study the effects of toponym (place name) ambiguity in the Geographical Information Retrieval (GIR) task. Our experience with GIR systems shows that toponym ambiguity may be an important factor in the inability ...
A GIR architecture with semantic-flavored query reformulation
GIR '10: Proceedings of the 6th Workshop on Geographic Information RetrievalMost geographic queries include references to entities (geographic and non-geographic). Grounding such entities is essential to properly understand the user's information need. As statistical-based query reformulation strategies work at term level, not ...
Map-based filters for fuzzy entities in geographical information retrieval
NLDB'11: Proceedings of the 16th international conference on Natural language processing and information systemsMany users employ vague geographical expressions to query Information Retrieval systems. These fuzzy entities do not appear neither in gazetteers nor in geographical databases. Searches such as "Ski resorts in north-central Spain" or "Restaurants near ...
Comments