skip to main content
10.1145/2533888.2533938acmconferencesArticle/Chapter ViewAbstractPublication PagesgirConference Proceedingsconference-collections
research-article

Assessment of the accuracy of GeoNames gazetteer data

Published: 05 November 2013 Publication History

Abstract

Gazetteers are the basis of many geospatial applications and serve an important role to collect and make available knowledge about the physical world such as place names and their coordinates. GeoNames is one of the largest and most often used gazetteer and it is generally assumed to be of sufficient quality. In this paper, we examine the quality and accuracy of the data in more detail, triggered by some anomalies encountered during its use. We present a classification of inaccuracies ranging from grid patterns, imprecise coordinates, overlaps and repetitions as well as misclassifications and visualize these for a range of countries. We finally give an outlook into potential corrections.

References

[1]
A. I. Abdelmoty and C. B. Jones. Towards maintaining consistency of spatial databases. In CIKM'97, 1997.
[2]
D. Ahlers. Towards Geospatial Search for Honduras. In LACNEM 2011, 2011.
[3]
D. Ahlers. Multi-source conflating index construction for local search in a low-coverage country. In LA-WEB 2012, 2012.
[4]
D. Ahlers. Lo mejor de dos idiomas -- Cross-lingual linkage of geotagged Wikipedia articles. In ECIR2013.
[5]
D. Ahlers and S. Boll. On the Accuracy of Online Geocoders. In Geoinformatik 2009, 2009.
[6]
R. Bakshi, C. A. Knoblock, and S. Thakkar. Exploiting Online Sources to Accurately Geocode Addresses. In GIS '04, 2004.
[7]
P. V. Bolstad and J. L. Smith. Errors in GIS: Assessing Spatial Data Accuracy. J. Forest., 90(11), 1992.
[8]
T. J. Brunner and R. S. Purves. Spatial auto-correlation and toponym ambiguity. GIR '08, 2008.
[9]
R. Devillers, A. Stein, Y. Bédard, N. Chrisman, P. Fisher, and W. Shi. Thirty years of research on spatial data quality: achievements, failures, and opportunities. Transactions in GIS, 14(4), 2010.
[10]
D. W. Goldberg, J. P. Wilson, and C. A. Knoblock. From Text to Geographic Coordinates: The Current State of Geocoding. URISA, 19(1), 2007.
[11]
C. Hauff. A Study on the Accuracy of Flickr's Geotag Data. SIGIR '13, 2013.
[12]
M. Helbich, C. Amelunxen, P. Neis, and A. Zipf. Comparative Spatial Analysis of Positional Accuracy of OpenStreetMap and Proprietary Geodata. Angewandte Geoinformatik, 2012.
[13]
L. L. Hill. Core Elements of Digital Gazetteers: Placenames, Categories, and Footprints. In ECDL '00, 2000.
[14]
T. J. Holmes and S. Lee. Cities as six-by-six-mile squares: Zipf's law? Agglomeration Economics, 2010.
[15]
H. Manguinhas, B. Martins, and J. L. Borbinha. A Geo-Temporal Web Gazetteer Integrating Data From Multiple Sources. In ICDIM, 2008.
[16]
D. Potere. Horizontal Positional Accuracy of Google Earth's High-Resolution Imagery Archive. Sensors, 8(12), 2008.
[17]
L. A. Souza, C. A. Davis, Jr., K. A. V. Borges, T. M. Delboni, and A. H. F. Laender. The Role of Gazetteers in Geographic Knowledge Discovery on the Web. In LA-WEB '05, 2005.
[18]
J. Wieczorek, Q. Guo, and R. Hijmans. The point-radius method for georeferencing locality descriptions and calculating associated uncertainty. Int. J. Geogr. Inf. Sci., 18(8), 2004.
[19]
J. Zhang and M. Goodchild. Uncertainty in Geographical Information. Taylor and Francis, 2002.

Cited By

View all

Index Terms

  1. Assessment of the accuracy of GeoNames gazetteer data

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GIR '13: Proceedings of the 7th Workshop on Geographic Information Retrieval
    November 2013
    92 pages
    ISBN:9781450322416
    DOI:10.1145/2533888
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 November 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data quality
    2. entity reconciliation
    3. gazetteer
    4. geocoding
    5. positional accuracy
    6. source integration

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SIGSPATIAL'13
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 46 of 61 submissions, 75%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)47
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Developing practices for FAIR and linked data in Heritage Sciencenpj Heritage Science10.1038/s40494-025-01598-x13:1Online publication date: 1-Mar-2025
    • (2025)A survey on pragmatic processing techniquesInformation Fusion10.1016/j.inffus.2024.102712114(102712)Online publication date: Feb-2025
    • (2024)Representing Geospatial Knowledge in NarrativesJournal on Computing and Cultural Heritage 10.1145/370391818:1(1-15)Online publication date: 12-Nov-2024
    • (2024)On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper)ACM Transactions on Spatial Algorithms and Systems10.1145/365307010:2(1-46)Online publication date: 1-Jul-2024
    • (2024)The Construction of a Mountain Vegetation Knowledge Graph Incorporating With Geographical Principles, Maps, and Remote Sensing ImagesIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2024.349345562(1-15)Online publication date: 2024
    • (2024)A survey on semantic processing techniquesInformation Fusion10.1016/j.inffus.2023.101988101(101988)Online publication date: Jan-2024
    • (2023)Farm Friendly Chat BotInternational Journal of Advanced Research in Science, Communication and Technology10.48175/IJARSCT-13160(435-439)Online publication date: 21-Oct-2023
    • (2023) EVKG : An interlinked and interoperable electric vehicle knowledge graph for smart transportation system Transactions in GIS10.1111/tgis.1306427:4(949-974)Online publication date: 23-May-2023
    • (2023)Estimating Bounding Box for Point of Interest Using Social Media Geo-Tagged PhotosIEEE Access10.1109/ACCESS.2023.323901411(7837-7849)Online publication date: 2023
    • (2023)ecolo-zip: A global, rich and granular characterization of biogeophysical ecology for 1.5 million postal codesScientific Data10.1038/s41597-023-02579-010:1Online publication date: 29-Sep-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media