research-article

Construction of a Japanese gazetteers for Japanese local toponym disambiguation

Authors:
Masaharu Yoshioka

Hokkaido University, Hokkaido, Japan

Hokkaido University, Hokkaido, Japan
View Profile

,
Takahiro Fujiwara

Hokkaido University, Hokkaido, Japan

Hokkaido University, Hokkaido, Japan
View Profile

GIR '13: Proceedings of the 7th Workshop on Geographic Information RetrievalNovember 2013Pages 57–63https://doi.org/10.1145/2533888.2533930

Published:05 November 2013Publication History

GIR '13: Proceedings of the 7th Workshop on Geographic Information Retrieval

Pages 57–63

ABSTRACT

When processing toponym information in natural language text, it is crucial to have a good gazetteers. There are several well-organized gazetteers for English text, but they do not cover Japanese local toponyms. In this paper, we introduce a Japanese gazetteers based on Open Data (e.g., the Toponym database distributed by Japanese ministries, Wikipedia, and GeoNames) and propose a toponym disambiguation framework that uses the constructed gazetteers. We also evaluate our approach based on a blog corpus that contains place names with high ambiguity.

References

D. Buscaldi. Approaches to disambiguating toponyms. SIGSPATIAL Special, 3(2):16--19, July 2011. Google ScholarDigital Library
M. Conti, S. K. Das, C. Bisdikian, M. Kumar, L. M. Ni, A. Passarella, G. Roussos, G. Troster, G. Tsudik, and F. Zambonelli. Looking ahead in pervasive computing: Challenges and opportunities in the era of cyber-physical convergence. Pervasive and Mobile Computing, 8(1):2--21, 2012. Google ScholarDigital Library
J. Gelernter and S. Balaji. An algorithm for local geoparsing of microtext. GeoInformatica, pages 1--33, 2013. Google ScholarDigital Library
F. Giunchiglia, V. Maltese, F. Farazi, and B. Dutta. Geowordnet: A resource for geo-spatial applications. In L. Aroyo, G. Antoniou, E. Hyvonen, A. ten Teije, H. Stuckenschmidt, L. Cabral, and T. Tudorache, editors, The Semantic Web: Research and Applications, volume 6088 of Lecture Notes in Computer Science, pages 121--136. Springer Berlin/Heidelberg, 2010. Google ScholarDigital Library
J. Hoffart, F. M. Suchanek, K. Berberich, and G. Weikum. Yago2: A spatially and temporally enhanced knowledge base from wikipedia. Artificial Intelligence, 194(0):28--61, 2013. <ce:title>Artificial Intelligence, Wikipedia and Semi-Structured Resources</ce:title>. Google ScholarDigital Library
J. Kazama and K. Torisawa. Inducing gazetteers for named entity recognition by large-scale clustering of dependency relations. In ACL, pages 407--415, 2008.Google Scholar
T. Kudo and Y. Matsumoto. Japanese dependency analysis using cascaded chunking. In CoNLL 2002: Proceedings of the 6th Conference on Natural Language Learning 2002 (COLING 2002 Post-Conference Workshops), pages 63--69, 2002. Google ScholarDigital Library
A. Popescu, G. Grefenstette, and H. Bouamor. Mining a multilingual geographical gazetteer from the web. In Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, WI-IAT '09, pages 58--65, Washington, DC, USA, 2009. IEEE Computer Society. Google ScholarDigital Library
E. Rauch, M. Bukatin, and K. Baker. A confidence-based framework for disambiguating geographic terms. In Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1, HLT-NAACL-GEOREF '03, pages 50--54, Stroudsburg, PA, USA, 2003. Association for Computational Linguistics. Google ScholarDigital Library
R. Volz, J. Kleb, and W. Mueller. Towards ontology-based disambiguation of geographical identifiers. In WWW2007, 2007.Google Scholar
X. Wang, Y. Zhang, M. Chen, X. Lin, H. Yu, and Y. Liu. An evidence-based approach for toponym disambiguation. In Geoinformatics, 2010, pages 1--7, 2010.Google ScholarCross Ref
M. Yoshioka and N. Kando. Issues for linking geographical open data of geonames and wikipedia. In H. Takeda, Y. Qu, R. Mizoguchi, and Y. Kitamura, editors, Semantic Technology, volume 7774 of Lecture Notes in Computer Science, pages 375--381. Springer Berlin Heidelberg, 2013.Google Scholar

Index Terms

Construction of a Japanese gazetteers for Japanese local toponym disambiguation
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Language resources
2. Information systems
  1. Information systems applications

Recommendations

A disambiguation method for Japanese compound verbs
MWE '03: Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18

The purpose of this study is to construct a semantic analysis method for disambiguating Japanese compound verbs. Japanese speakers produce a rich variety of compound verbs, making it difficult to process them by computer. We construct a method employing ...
Read More
Wikipedia Mining for Huge Scale Japanese Association Thesaurus Construction
AINAW '08: Proceedings of the 22nd International Conference on Advanced Information Networking and Applications - Workshops

Wikipedia, a huge scale Web-based dictionary, is an impressive corpus for knowledge extraction. We already proved that Wikipedia can be used for constructing an English association thesaurus and our link structure mining method is significantly ...
Read More
Construction and analysis of Japanese-English broadcast news corpus with named entity tags
MultiNER '03: Proceedings of the ACL 2003 workshop on Multilingual and mixed-language named entity recognition - Volume 15

We are aiming to acquire named entity (NE) translation knowledge from nonparallel, content-aligned corpora, by utilizing NE extraction techniques. For this research, we are constructing a Japanese-English broadcast news corpus with NE tags. The tags ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GIR '13: Proceedings of the 7th Workshop on Geographic Information Retrieval
November 2013
92 pages
ISBN:9781450322416
DOI:10.1145/2533888
Editors:
Chris Jones
Cardiff University
,
Ross Purves
University of Zurich
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 November 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate46of61submissions,75%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 104
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Construction of a Japanese gazetteers for Japanese local toponym disambiguation

GIR '13: Proceedings of the 7th Workshop on Geographic Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

A disambiguation method for Japanese compound verbs

Wikipedia Mining for Huge Scale Japanese Association Thesaurus Construction

Construction and analysis of Japanese-English broadcast news corpus with named entity tags

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Construction of a Japanese gazetteers for Japanese local toponym disambiguation

GIR '13: Proceedings of the 7th Workshop on Geographic Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

A disambiguation method for Japanese compound verbs

Wikipedia Mining for Huge Scale Japanese Association Thesaurus Construction

Construction and analysis of Japanese-English broadcast news corpus with named entity tags

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media