skip to main content
10.1145/1722080.1722099acmotherconferencesArticle/Chapter ViewAbstractPublication PagesgirConference Proceedingsconference-collections
research-article

Grounding toponyms in an Italian local news corpus

Published:18 February 2010Publication History

ABSTRACT

In this paper we present a study carried out over toponyms contained in an Italian news collection, in order to determine the degree of ambiguity of toponyms and how difficult could be to resolve such ambiguities. The results show that frequent toponyms are usually less ambiguous than rare to-ponyms. The resolution of ambiguities on a sample of 1,042 toponyms with different features confirms that ambiguous toponyms are spatially autocorrelated.

References

  1. G. Andogah, G. Bouma, J. Nerbonne, and E. Koster. Placename ambiguity resolution. In LREC 2008 workshop on Methodologies and Resources for Processing Spatial Language, 2008.Google ScholarGoogle Scholar
  2. T. J. Brunner and R. S. Purves. Spatial autocorrelation and toponym ambiguity. In GIR '08: Proceeding of the 2nd international workshop on Geographic information retrieval, pages 25--26, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Buscaldi and P. Rosso. A conceptual density-based approach for the disambiguation of toponyms. International Journal of Geographical Information Systems, 22(3):301--313, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Buscaldi and P. Rosso. Map-based vs. knowledge-based toponym disambiguation. In GIR '08: Proceeding of the 2nd international workshop on Geographic information retrieval, pages 19--22, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. G. Emanuele Pianta and R. Zanoli. The TextPRO Tool Suite. In N. C. et al., editor, Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), Marrakech, Morocco, may 2008. European Language Resources Association (ELRA). http://www.lrec-conf.org/proceedings/lrec2008/.Google ScholarGoogle Scholar
  6. E. Garbin and I. Mani. Disambiguating toponyms in news. In conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT05), pages 363--370, Morristown, NJ, USA, 2005. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Linguistic Data Consortium. ACE English Annotation Guidelines for Entities, 2008. http://projects.ldc.upenn.edu/ace/docs/English-Entities-Guidelines_v6.6.pdf.Google ScholarGoogle Scholar
  8. E. Pianta and R. Zanoli. Exploiting SVM for Italian Named Entity Recognition. Intelligenza Artificiale, Special issue on NLP Tools for Italian, IV(2), 2007. In Italian.Google ScholarGoogle Scholar
  9. B. Pouliquen, M. Kimler, R. Steinberger, C. Ignat, T. Oellinger, K. Blackler, F. Fuart, W. Zaghouani, A. Widiger, A.-C. Forslund, and C. Best. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC-2006), pages 53--58, Genova, Italy, 2006.Google ScholarGoogle Scholar
  10. D. A. Smith and G. Crane. Disambiguating geographic names in a historical digital library. In Research and Advanced Technology for Digital Libraries, volume 2163 of Lecture Notes in Computer Science, pages 127--137. Springer, Berlin, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. A. Smith and G. S. Mann. Bootstrapping toponym classifiers. In Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references, pages 45--49, Morristown, NJ, USA, 2003. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Volz, J. Kleb, and W. Mueller. Towards ontology-based disambiguation of geographical identifiers. In I3 Workshop held at the 16th International World Wide Web Conference (WWW2007), Banff, Alberta, Canada, 2007.Google ScholarGoogle Scholar
  13. G. K. Zipf. Human Behavior and the Principle of Least Effort. Addison-Wesley (Reading MA), 1949.Google ScholarGoogle Scholar

Index Terms

  1. Grounding toponyms in an Italian local news corpus

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      GIR '10: Proceedings of the 6th Workshop on Geographic Information Retrieval
      February 2010
      130 pages
      ISBN:9781605588261
      DOI:10.1145/1722080

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 February 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate46of61submissions,75%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader