This chapter presents a toponym disambiguation approach based on supervised machine learning. The proposed approach uses a simple hierarchical geographic relationship model to describe geographic entities and geographic relationships among them. The disambiguation procedure begins with the identification of toponyms in documents by applying and extending the state-of-the-art named entity recognition technologies and then performs disambiguation as a supervised classification processes over a feature space of geographic relationships. A geographic knowledge base is modeled and constructed to support the whole disambiguation procedure. System performance is evaluated on a document collection consisting of 15,194 local Australian news articles. The experiment results show that the disambiguation accuracy ranges from 73.55 to 85.38 percent depending on the running parameters and the learning strategies used.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Hu, YH., Ge, L. (2007). A Supervised Machine Learning Approach to Toponym Disambiguation. In: Scharl, A., Tochtermann, K. (eds) The Geospatial Web. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-84628-827-2_11
Download citation
DOI: https://doi.org/10.1007/978-1-84628-827-2_11
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-84628-826-5
Online ISBN: 978-1-84628-827-2
eBook Packages: Computer ScienceComputer Science (R0)