Skip to main content

Using Recurrent Neural Networks for Toponym Resolution in Text

  • Conference paper
  • First Online:
Progress in Artificial Intelligence (EPIA 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11805))

Included in the following conference series:

Abstract

Toponym resolution refers to the disambiguation of place names and other references to places present in textual documents, resolving them to unambiguous geographical identifiers (e.g., geographic coordinates of latitude and longitude). One of the major challenges in this task is that, usually, place names are highly ambiguous (e.g., there are several locations on the surface of the Earth that share the same name). In this paper, we propose to address the task through a recurrent neural network architecture with multiple inputs and outputs, specifically leveraging pre-trained contextual embeddings (ELMo) and bi-directional Long Short-Term Memory (LSTM) units, both commonly used for textual data modeling. The proposed model was tested on two datasets that were previously used to evaluate toponym resolution systems, namely the War of the Rebellion and the Local-Global Lexicon corpora. The obtained results outperform state-of-the-art results, confirming the superiority of the proposed method over other previous approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://pypi.org/project/healpy/.

  2. 2.

    http://github.com/utcompling/WarOfTheRebellion.

  3. 3.

    http://github.com/milangritta/Pragmatic-Guide-to-Geoparsing-Evaluation/blob/master/data/Corpora/lgl.xml.

References

  1. Adams, B., McKenzie, G.: Crowdsourcing the character of a place: character-level convolutional networks for multilingual geographic text classification. Trans. GIS 22(2), 394–408 (2018)

    Article  Google Scholar 

  2. Ardanuy, M.C., Sporleder, C.: Toponym disambiguation in historical documents using semantic and geographic features. In: Proceedings of the International Conference on Digital Access to Textual Cultural Heritage (2017)

    Google Scholar 

  3. Berman, M.L., Mostern, R., Southall, H.: Placing Names: Enriching and Integrating Gazetteers. Indiana University Press (2016)

    Google Scholar 

  4. DeLozier, G., Baldridge, J., London, L.: Gazetteer-independent toponym resolution using geographic word profiles. In: Proceedings of the AAAI Conference on Artificial Intelligence (2015)

    Google Scholar 

  5. DeLozier, G., Wing, B.P., Baldridge, J., Nesbit, S.: Creating a novel geolocation corpus from historical texts. In: Proceedings of the Linguistic Annotation Workshop Held in Conjunction with the ACL Conference (2016)

    Google Scholar 

  6. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. CoRR abs/1810.04805 (2018)

    Google Scholar 

  7. Eger, S., Youssef, P., Gurevych, I.: Is it time to swish? Comparing deep learning activation functions across NLP tasks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2018)

    Google Scholar 

  8. Freire, N., Borbinha, J.L., Calado, P., Martins, B.: A metadata geoparsing system for place name recognition and resolution in metadata records. In: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (2011)

    Google Scholar 

  9. Goldberg, Y.: A primer on neural network models for natural language processing. J. Artif. Intell. Res. 57(1), 345–420 (2016)

    Article  MathSciNet  Google Scholar 

  10. Górski, K., et al.: HEALPix: a framework for high-resolution discretization and fast analysis of data distributed on the sphere. Astrophys. J. 622(2), 759 (2005)

    Article  Google Scholar 

  11. Gritta, M., Pilehvar, M.T., Collier, N.: Which Melbourne? Augmenting geocoding with maps. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (2018)

    Google Scholar 

  12. Gritta, M., Pilehvar, M.T., Limsopatham, N., Collier, N.: What’s missing in geographical parsing? Lang. Resour. Eval. 52(2), 603–623 (2018)

    Article  Google Scholar 

  13. Karimzadeh, M., Pezanowski, S., MacEachren, A.M., Wallgrün, J.O.: GeoTxt: a scalable geoparsing system for unstructured text geolocation. Trans. GIS 23(1), 118–136 (2019)

    Article  Google Scholar 

  14. Leidner, J.L.: Toponym resolution in text: annotation, evaluation and applications of spatial grounding. In: Special Interest Group on Information Retrieval Forum, vol. 41, no. 2 (2007)

    Article  Google Scholar 

  15. Lieberman, M.D., Samet, H.: Adaptive context features for toponym resolution in streaming news. In: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval (2012)

    Google Scholar 

  16. Lieberman, M.D., Samet, H., Sankaranarayanan, J.: Geotagging with local lexicons to build indexes for textually-specified spatial data. In: Proceedings of the IEEE International Conference on Data Engineering (2010)

    Google Scholar 

  17. Manguinhas, H., Martins, B., Borbinha, J.L., Siabato, W.: The DIGMAP geo-temporal web gazetteer service. E-Perimetron 4(1), 9–24 (2009)

    Google Scholar 

  18. Melo, F., Martins, B.: Automated geocoding of textual documents: a survey of current approaches. Trans. GIS 21(1), 3–8 (2017)

    Article  Google Scholar 

  19. Monteiro, B.R., Davis, C.A., Fonseca, F.T.: A survey on the geographic scope of textual documents. Comput. Geosci. 96, 23–34 (2016)

    Article  Google Scholar 

  20. Peters, M.E., et al.: Deep contextualized word representations. In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2018)

    Google Scholar 

  21. Santos, J., Anastácio, I., Martins, B.: Using machine learning methods for disambiguating place references in textual documents. GeoJournal 80(3), 375–392 (2015)

    Article  Google Scholar 

  22. Smith, L.N.: Cyclical learning rates for training neural networks. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision (2017)

    Google Scholar 

  23. Vincenty, T.: Direct and inverse solutions of geodesics on the ellipsoid with application of nested equations. Surv. Rev. 23, 88–93 (1975)

    Article  Google Scholar 

  24. Wing, B.P.: Text-based document geolocation and its application to the digital humanities. Ph.D. thesis, The University of Texas at Austin (2015)

    Google Scholar 

  25. Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R., Le, Q.V.: XLNet: generalized autoregressive pretraining for language understanding (2019)

    Google Scholar 

Download references

Acknowledgments

This research was supported through Fundação para a Ciência e Tecnologia (FCT), through the project grants with references PTDC/EEI-SCR/1743/2014 (Saturn), T-AP HJ-253525 (DigCH), and PTDC/CCI-CIF/32607/2017 (MIMU), as well as through the INESC-ID multi-annual funding from the PIDDAC programme (UID/CEC/50021/2019). We also gratefully acknowledge the support of NVIDIA Corporation, with the donation of two Titan Xp GPUs used in the experiments reported on the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ana Bárbara Cardoso .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Cardoso, A.B., Martins, B., Estima, J. (2019). Using Recurrent Neural Networks for Toponym Resolution in Text. In: Moura Oliveira, P., Novais, P., Reis, L. (eds) Progress in Artificial Intelligence. EPIA 2019. Lecture Notes in Computer Science(), vol 11805. Springer, Cham. https://doi.org/10.1007/978-3-030-30244-3_63

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-30244-3_63

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-30243-6

  • Online ISBN: 978-3-030-30244-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics