Abstract
Toponym resolution refers to the disambiguation of place names and other references to places present in textual documents, resolving them to unambiguous geographical identifiers (e.g., geographic coordinates of latitude and longitude). One of the major challenges in this task is that, usually, place names are highly ambiguous (e.g., there are several locations on the surface of the Earth that share the same name). In this paper, we propose to address the task through a recurrent neural network architecture with multiple inputs and outputs, specifically leveraging pre-trained contextual embeddings (ELMo) and bi-directional Long Short-Term Memory (LSTM) units, both commonly used for textual data modeling. The proposed model was tested on two datasets that were previously used to evaluate toponym resolution systems, namely the War of the Rebellion and the Local-Global Lexicon corpora. The obtained results outperform state-of-the-art results, confirming the superiority of the proposed method over other previous approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adams, B., McKenzie, G.: Crowdsourcing the character of a place: character-level convolutional networks for multilingual geographic text classification. Trans. GIS 22(2), 394–408 (2018)
Ardanuy, M.C., Sporleder, C.: Toponym disambiguation in historical documents using semantic and geographic features. In: Proceedings of the International Conference on Digital Access to Textual Cultural Heritage (2017)
Berman, M.L., Mostern, R., Southall, H.: Placing Names: Enriching and Integrating Gazetteers. Indiana University Press (2016)
DeLozier, G., Baldridge, J., London, L.: Gazetteer-independent toponym resolution using geographic word profiles. In: Proceedings of the AAAI Conference on Artificial Intelligence (2015)
DeLozier, G., Wing, B.P., Baldridge, J., Nesbit, S.: Creating a novel geolocation corpus from historical texts. In: Proceedings of the Linguistic Annotation Workshop Held in Conjunction with the ACL Conference (2016)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. CoRR abs/1810.04805 (2018)
Eger, S., Youssef, P., Gurevych, I.: Is it time to swish? Comparing deep learning activation functions across NLP tasks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2018)
Freire, N., Borbinha, J.L., Calado, P., Martins, B.: A metadata geoparsing system for place name recognition and resolution in metadata records. In: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (2011)
Goldberg, Y.: A primer on neural network models for natural language processing. J. Artif. Intell. Res. 57(1), 345–420 (2016)
Górski, K., et al.: HEALPix: a framework for high-resolution discretization and fast analysis of data distributed on the sphere. Astrophys. J. 622(2), 759 (2005)
Gritta, M., Pilehvar, M.T., Collier, N.: Which Melbourne? Augmenting geocoding with maps. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (2018)
Gritta, M., Pilehvar, M.T., Limsopatham, N., Collier, N.: What’s missing in geographical parsing? Lang. Resour. Eval. 52(2), 603–623 (2018)
Karimzadeh, M., Pezanowski, S., MacEachren, A.M., Wallgrün, J.O.: GeoTxt: a scalable geoparsing system for unstructured text geolocation. Trans. GIS 23(1), 118–136 (2019)
Leidner, J.L.: Toponym resolution in text: annotation, evaluation and applications of spatial grounding. In: Special Interest Group on Information Retrieval Forum, vol. 41, no. 2 (2007)
Lieberman, M.D., Samet, H.: Adaptive context features for toponym resolution in streaming news. In: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval (2012)
Lieberman, M.D., Samet, H., Sankaranarayanan, J.: Geotagging with local lexicons to build indexes for textually-specified spatial data. In: Proceedings of the IEEE International Conference on Data Engineering (2010)
Manguinhas, H., Martins, B., Borbinha, J.L., Siabato, W.: The DIGMAP geo-temporal web gazetteer service. E-Perimetron 4(1), 9–24 (2009)
Melo, F., Martins, B.: Automated geocoding of textual documents: a survey of current approaches. Trans. GIS 21(1), 3–8 (2017)
Monteiro, B.R., Davis, C.A., Fonseca, F.T.: A survey on the geographic scope of textual documents. Comput. Geosci. 96, 23–34 (2016)
Peters, M.E., et al.: Deep contextualized word representations. In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2018)
Santos, J., Anastácio, I., Martins, B.: Using machine learning methods for disambiguating place references in textual documents. GeoJournal 80(3), 375–392 (2015)
Smith, L.N.: Cyclical learning rates for training neural networks. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision (2017)
Vincenty, T.: Direct and inverse solutions of geodesics on the ellipsoid with application of nested equations. Surv. Rev. 23, 88–93 (1975)
Wing, B.P.: Text-based document geolocation and its application to the digital humanities. Ph.D. thesis, The University of Texas at Austin (2015)
Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R., Le, Q.V.: XLNet: generalized autoregressive pretraining for language understanding (2019)
Acknowledgments
This research was supported through Fundação para a Ciência e Tecnologia (FCT), through the project grants with references PTDC/EEI-SCR/1743/2014 (Saturn), T-AP HJ-253525 (DigCH), and PTDC/CCI-CIF/32607/2017 (MIMU), as well as through the INESC-ID multi-annual funding from the PIDDAC programme (UID/CEC/50021/2019). We also gratefully acknowledge the support of NVIDIA Corporation, with the donation of two Titan Xp GPUs used in the experiments reported on the paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Cardoso, A.B., Martins, B., Estima, J. (2019). Using Recurrent Neural Networks for Toponym Resolution in Text. In: Moura Oliveira, P., Novais, P., Reis, L. (eds) Progress in Artificial Intelligence. EPIA 2019. Lecture Notes in Computer Science(), vol 11805. Springer, Cham. https://doi.org/10.1007/978-3-030-30244-3_63
Download citation
DOI: https://doi.org/10.1007/978-3-030-30244-3_63
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30243-6
Online ISBN: 978-3-030-30244-3
eBook Packages: Computer ScienceComputer Science (R0)