Using Recurrent Neural Networks for Toponym Resolution in Text

Cardoso, Ana Bárbara; Martins, Bruno; Estima, Jacinto

doi:10.1007/978-3-030-30244-3_63

Ana Bárbara Cardoso¹¹,
Bruno Martins¹¹ &
Jacinto Estima¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11805))

Included in the following conference series:

EPIA Conference on Artificial Intelligence

1824 Accesses
2 Citations
2 Altmetric

Abstract

Toponym resolution refers to the disambiguation of place names and other references to places present in textual documents, resolving them to unambiguous geographical identifiers (e.g., geographic coordinates of latitude and longitude). One of the major challenges in this task is that, usually, place names are highly ambiguous (e.g., there are several locations on the surface of the Earth that share the same name). In this paper, we propose to address the task through a recurrent neural network architecture with multiple inputs and outputs, specifically leveraging pre-trained contextual embeddings (ELMo) and bi-directional Long Short-Term Memory (LSTM) units, both commonly used for textual data modeling. The proposed model was tested on two datasets that were previously used to evaluate toponym resolution systems, namely the War of the Rebellion and the Local-Global Lexicon corpora. The obtained results outperform state-of-the-art results, confirming the superiority of the proposed method over other previous approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Adams, B., McKenzie, G.: Crowdsourcing the character of a place: character-level convolutional networks for multilingual geographic text classification. Trans. GIS 22(2), 394–408 (2018)
Article Google Scholar
Ardanuy, M.C., Sporleder, C.: Toponym disambiguation in historical documents using semantic and geographic features. In: Proceedings of the International Conference on Digital Access to Textual Cultural Heritage (2017)
Google Scholar
Berman, M.L., Mostern, R., Southall, H.: Placing Names: Enriching and Integrating Gazetteers. Indiana University Press (2016)
Google Scholar
DeLozier, G., Baldridge, J., London, L.: Gazetteer-independent toponym resolution using geographic word profiles. In: Proceedings of the AAAI Conference on Artificial Intelligence (2015)
Google Scholar
DeLozier, G., Wing, B.P., Baldridge, J., Nesbit, S.: Creating a novel geolocation corpus from historical texts. In: Proceedings of the Linguistic Annotation Workshop Held in Conjunction with the ACL Conference (2016)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. CoRR abs/1810.04805 (2018)
Google Scholar
Eger, S., Youssef, P., Gurevych, I.: Is it time to swish? Comparing deep learning activation functions across NLP tasks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2018)
Google Scholar
Freire, N., Borbinha, J.L., Calado, P., Martins, B.: A metadata geoparsing system for place name recognition and resolution in metadata records. In: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (2011)
Google Scholar
Goldberg, Y.: A primer on neural network models for natural language processing. J. Artif. Intell. Res. 57(1), 345–420 (2016)
Article MathSciNet Google Scholar
Górski, K., et al.: HEALPix: a framework for high-resolution discretization and fast analysis of data distributed on the sphere. Astrophys. J. 622(2), 759 (2005)
Article Google Scholar
Gritta, M., Pilehvar, M.T., Collier, N.: Which Melbourne? Augmenting geocoding with maps. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (2018)
Google Scholar
Gritta, M., Pilehvar, M.T., Limsopatham, N., Collier, N.: What’s missing in geographical parsing? Lang. Resour. Eval. 52(2), 603–623 (2018)
Article Google Scholar
Karimzadeh, M., Pezanowski, S., MacEachren, A.M., Wallgrün, J.O.: GeoTxt: a scalable geoparsing system for unstructured text geolocation. Trans. GIS 23(1), 118–136 (2019)
Article Google Scholar
Leidner, J.L.: Toponym resolution in text: annotation, evaluation and applications of spatial grounding. In: Special Interest Group on Information Retrieval Forum, vol. 41, no. 2 (2007)
Article Google Scholar
Lieberman, M.D., Samet, H.: Adaptive context features for toponym resolution in streaming news. In: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval (2012)
Google Scholar
Lieberman, M.D., Samet, H., Sankaranarayanan, J.: Geotagging with local lexicons to build indexes for textually-specified spatial data. In: Proceedings of the IEEE International Conference on Data Engineering (2010)
Google Scholar
Manguinhas, H., Martins, B., Borbinha, J.L., Siabato, W.: The DIGMAP geo-temporal web gazetteer service. E-Perimetron 4(1), 9–24 (2009)
Google Scholar
Melo, F., Martins, B.: Automated geocoding of textual documents: a survey of current approaches. Trans. GIS 21(1), 3–8 (2017)
Article Google Scholar
Monteiro, B.R., Davis, C.A., Fonseca, F.T.: A survey on the geographic scope of textual documents. Comput. Geosci. 96, 23–34 (2016)
Article Google Scholar
Peters, M.E., et al.: Deep contextualized word representations. In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2018)
Google Scholar
Santos, J., Anastácio, I., Martins, B.: Using machine learning methods for disambiguating place references in textual documents. GeoJournal 80(3), 375–392 (2015)
Article Google Scholar
Smith, L.N.: Cyclical learning rates for training neural networks. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision (2017)
Google Scholar
Vincenty, T.: Direct and inverse solutions of geodesics on the ellipsoid with application of nested equations. Surv. Rev. 23, 88–93 (1975)
Article Google Scholar
Wing, B.P.: Text-based document geolocation and its application to the digital humanities. Ph.D. thesis, The University of Texas at Austin (2015)
Google Scholar
Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R., Le, Q.V.: XLNet: generalized autoregressive pretraining for language understanding (2019)
Google Scholar

Download references

Acknowledgments

This research was supported through Fundação para a Ciência e Tecnologia (FCT), through the project grants with references PTDC/EEI-SCR/1743/2014 (Saturn), T-AP HJ-253525 (DigCH), and PTDC/CCI-CIF/32607/2017 (MIMU), as well as through the INESC-ID multi-annual funding from the PIDDAC programme (UID/CEC/50021/2019). We also gratefully acknowledge the support of NVIDIA Corporation, with the donation of two Titan Xp GPUs used in the experiments reported on the paper.

Author information

Authors and Affiliations

INESC-ID and Instituto Superior Técnico, Lisbon, Portugal
Ana Bárbara Cardoso & Bruno Martins
INESC-ID and Instituto Politécnico de Setúbal, Lisbon, Portugal
Jacinto Estima

Authors

Ana Bárbara Cardoso
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Martins
View author publications
You can also search for this author in PubMed Google Scholar
Jacinto Estima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ana Bárbara Cardoso .

Editor information

Editors and Affiliations

INESC-TEC, University of Trás-os-Montes and Alto Douro, Vila Real, Portugal
Paulo Moura Oliveira
University of Minho, Braga, Portugal
Paulo Novais
LIACC/UP, University of Porto, Porto, Portugal
Luís Paulo Reis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cardoso, A.B., Martins, B., Estima, J. (2019). Using Recurrent Neural Networks for Toponym Resolution in Text. In: Moura Oliveira, P., Novais, P., Reis, L. (eds) Progress in Artificial Intelligence. EPIA 2019. Lecture Notes in Computer Science(), vol 11805. Springer, Cham. https://doi.org/10.1007/978-3-030-30244-3_63

Download citation

DOI: https://doi.org/10.1007/978-3-030-30244-3_63
Published: 30 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30243-6
Online ISBN: 978-3-030-30244-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics