skip to main content
10.1145/2533888.2533930acmconferencesArticle/Chapter ViewAbstractPublication PagesgirConference Proceedingsconference-collections
research-article

Construction of a Japanese gazetteers for Japanese local toponym disambiguation

Published:05 November 2013Publication History

ABSTRACT

When processing toponym information in natural language text, it is crucial to have a good gazetteers. There are several well-organized gazetteers for English text, but they do not cover Japanese local toponyms. In this paper, we introduce a Japanese gazetteers based on Open Data (e.g., the Toponym database distributed by Japanese ministries, Wikipedia, and GeoNames) and propose a toponym disambiguation framework that uses the constructed gazetteers. We also evaluate our approach based on a blog corpus that contains place names with high ambiguity.

References

  1. D. Buscaldi. Approaches to disambiguating toponyms. SIGSPATIAL Special, 3(2):16--19, July 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Conti, S. K. Das, C. Bisdikian, M. Kumar, L. M. Ni, A. Passarella, G. Roussos, G. Troster, G. Tsudik, and F. Zambonelli. Looking ahead in pervasive computing: Challenges and opportunities in the era of cyber-physical convergence. Pervasive and Mobile Computing, 8(1):2--21, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Gelernter and S. Balaji. An algorithm for local geoparsing of microtext. GeoInformatica, pages 1--33, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. F. Giunchiglia, V. Maltese, F. Farazi, and B. Dutta. Geowordnet: A resource for geo-spatial applications. In L. Aroyo, G. Antoniou, E. Hyvonen, A. ten Teije, H. Stuckenschmidt, L. Cabral, and T. Tudorache, editors, The Semantic Web: Research and Applications, volume 6088 of Lecture Notes in Computer Science, pages 121--136. Springer Berlin/Heidelberg, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Hoffart, F. M. Suchanek, K. Berberich, and G. Weikum. Yago2: A spatially and temporally enhanced knowledge base from wikipedia. Artificial Intelligence, 194(0):28--61, 2013. <ce:title>Artificial Intelligence, Wikipedia and Semi-Structured Resources</ce:title>. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Kazama and K. Torisawa. Inducing gazetteers for named entity recognition by large-scale clustering of dependency relations. In ACL, pages 407--415, 2008.Google ScholarGoogle Scholar
  7. T. Kudo and Y. Matsumoto. Japanese dependency analysis using cascaded chunking. In CoNLL 2002: Proceedings of the 6th Conference on Natural Language Learning 2002 (COLING 2002 Post-Conference Workshops), pages 63--69, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Popescu, G. Grefenstette, and H. Bouamor. Mining a multilingual geographical gazetteer from the web. In Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, WI-IAT '09, pages 58--65, Washington, DC, USA, 2009. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. E. Rauch, M. Bukatin, and K. Baker. A confidence-based framework for disambiguating geographic terms. In Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1, HLT-NAACL-GEOREF '03, pages 50--54, Stroudsburg, PA, USA, 2003. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. R. Volz, J. Kleb, and W. Mueller. Towards ontology-based disambiguation of geographical identifiers. In WWW2007, 2007.Google ScholarGoogle Scholar
  11. X. Wang, Y. Zhang, M. Chen, X. Lin, H. Yu, and Y. Liu. An evidence-based approach for toponym disambiguation. In Geoinformatics, 2010, pages 1--7, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  12. M. Yoshioka and N. Kando. Issues for linking geographical open data of geonames and wikipedia. In H. Takeda, Y. Qu, R. Mizoguchi, and Y. Kitamura, editors, Semantic Technology, volume 7774 of Lecture Notes in Computer Science, pages 375--381. Springer Berlin Heidelberg, 2013.Google ScholarGoogle Scholar

Index Terms

  1. Construction of a Japanese gazetteers for Japanese local toponym disambiguation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        GIR '13: Proceedings of the 7th Workshop on Geographic Information Retrieval
        November 2013
        92 pages
        ISBN:9781450322416
        DOI:10.1145/2533888

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 5 November 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate46of61submissions,75%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader