Skip to main content

Disambiguating Geographic Names in a Historical Digital Library

  • Conference paper
  • First Online:
Research and Advanced Technology for Digital Libraries (ECDL 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2163))

Included in the following conference series:

Abstract

Geographic interfaces provide natural, scalable visualizations for many digital library collections, but the wide range of data in digital libraries presents some particular problems for identifying and disambiguating place names. We describe the toponym-disambiguation system in the Perseus digital library and evaluate its performance. Name categorization varies significantly among different types of documents, but toponym disambiguation performs at a high level of precision and recall with a gazetteer an order of magnitude larger than most other applications.

This research was supported by a grant from the Digital Libraries Initiative, Phase 2, with primary funding from the National Science Foundation and National Endowment for the Humanities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Association for Computational Linguistics. Proceedings of the Fifth Conference on Applied Natural Language Processing, Washington, DC, April 1997.

    Google Scholar 

  2. Daniel M. Bikel, Scott Miller, Richard Schwartz, and Ralph Weischedel. Nymble: A high-performance learning name-finder. In Proceedings of the Fifth Conference on Applied Natural Language Processing [1], pages 194–201.

    Google Scholar 

  3. David Day, John Aberdeen, Lynette Hirschman, Robyn Kozierok, Patricia Robinson, and Marc Vilain. Mixed-initiative development of language processing systems. In Proceedings of the Fifth Conference on Applied Natural Language Processing [1], pages 348–355.

    Google Scholar 

  4. Linda L. Hill, James Frew, and Qi Zheng. Geographic names: The implementation of a gazetteer in a georeferenced digital library. D-Lib Magazine, 5(1), January 1999. See http://www.dlib.org/dlib/january99/hill/01hill.html.

  5. Yasusi Kanada. A method of geographical name extraction from Japanese text for thematic geographical search. In Proceedings of the Eighth International Conference on Information and Knowledge Management, pages 46–54, Kansas City, Missouri, November 1999.

    Google Scholar 

  6. Ray R. Larson. Geographic information retrieval and spatial browsing. In Linda C. Smith and Myke Gluck, editors, Geographic Information Systems and Libraries: Patrons, Maps, and Spatial Information, pages 81–123, April 1995. See http://sherlock.berkeley.edu/geo_ir/PART1.html.

  7. David D. McDonald. Internal and external evidence in the identification and semantic categorization of proper names. In Branimir Boguraev and James Pustejovsky, editors, Corpus Processing for Lexical Acquisition, pages 21–39. MIT Press, Cambridge, MA, 1996.

    Google Scholar 

  8. Andreas M. Olligschlaeger and Alexander G. Hauptmann. Multimodal information systems and GIS: The Informedia digital video library. In Proceedings of the ESRI User Conference, San Diego, California, July 1999.

    Google Scholar 

  9. David A. Smith, Jeffrey A. Rydberg-Cox, and Gregory R. Crane. The Perseus Project: A digital library for the humanities. Literary and Linguistic Computing, 15(1):15–25, 2000.

    Article  Google Scholar 

  10. Nina Wacholder, Yael Ravin, and Misook Choi. Disambiguation of proper names in text. In Proceedings of the Fifth Conference on Applied Natural Language Processing [1], pages 202–208.

    Google Scholar 

  11. Allison G. Woodruff and Christian Plaunt. GIPSY: Automated geographic indexing of text documents. Journal of the American Society for Information Science, 45(9):645–655, 1994.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Smith, D.A., Crane, G. (2001). Disambiguating Geographic Names in a Historical Digital Library. In: Constantopoulos, P., Sølvberg, I.T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2001. Lecture Notes in Computer Science, vol 2163. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44796-2_12

Download citation

  • DOI: https://doi.org/10.1007/3-540-44796-2_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42537-3

  • Online ISBN: 978-3-540-44796-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics