Abstract
In the context of disaster management, location information is crucial in disaster scenarios to infer the incident location and facilitate disaster relief. In recent years the advent of social media has brought not only great opportunity to enhance disaster management in a crowdsourced perspective, but also a major challenge to interpret the noisy information. A conventional approach to location extraction from texts is Named Entity Recognition (NER), however it shows unsatisfactory performance on informal and colloquial texts such as social media messages, especially for the uncommon place names. To address this issue, we proposed a Bidirectional Long Short-Term Memory (LSTM) Neural Network with Conditional Random Field (CRF) layer to identify geo-entities especially the rarely known local places in social media messages, and the use of orthographic, semantic and syntactic features was explored to achieve best performance. The proposed model was tested on a dataset collected from Twitter, showing promising performance in detecting location information when compared with off-the-shelf NER tools.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahmed, A.: Use of social media in disaster management. In: International Conference on Information Systems, ICIS 2011. pp. 4149–4159 (2011)
Chatfield, A.T., Brajawidagda, U.: Twitter early tsunami warning system: a case study in Indonesia’s natural disaster management. In: Proceedings of the Annual Hawaii International Conference on System Science, pp. 2050–2060 (2013)
Nakaji, Y., Yanai, K.: Visualization of real-world events with geotagged tweet photos. In: Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, ICMEW, pp. 272–277. ICMEW (2012)
Sultanik, E.A., Fink, C.: Rapid geotagging and disambiguation of social media text via an indexed gazetteer. In: ISCRAM 2012 Conference Proceedings - 9th International Conference on Information Systems for Crisis Response and Management (2012)
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: an architecture for development of robust HLT applications. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, ACL 2002, p. 1688 (2001). https://doi.org/10.3115/1073083.1073112
Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by Gibbs sampling. In: ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. pp. 363–370 (2005)
Kazama, J., Torisawa, K.: Inducing gazetteers for named entity recognition by large-scale clustering of dependency relations. In: Proceedings of the Conference of 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT, ACL 2008. pp. 407–415 (2008)
Downey, D., Broadhead, M., Etzioni, O.: Locating complex named entities in web text. In: IJCAI International Joint Conference on Artificial Intelligence. pp. 2733 –2739 (2007)
Bontcheva, K., Derczynski, L., Funk, A., Greenwood, M. A., Maynard, D., Aswani, N.: Twitie: An open-source information extraction pipeline for microblog text. In: Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013, 83–90 (2013)
Lingad, J., Karimi, S., Yin, J.: Location extraction from disaster-related microblogs. In: Proceedings of the 22nd International Conference on World Wide Web, WWW 2013 Companion. pp. 1017–1020. ACM Press, New York (2013)
Awan, Z., Kahlke, T., Ralph, P.J., Kennedy, P.J.: Chemical named entity recognition with deep contextualized neural embeddings. In: IC3K 2019 - Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. pp. 135–144 (2019)
Rachman, V., Savitri, S., Augustianti, F., Mahendra, R.: Named entity recognition on Indonesian Twitter posts using long short-term memory networks. In: 2017 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017. pp. 228–232 (2018)
Hoang, T.B.N., Mothe, J.: Location extraction from tweets. Inf. Process. Manage. 54, 129–144 (2018). https://doi.org/10.1016/j.ipm.2017.11.001
Al-Olimat, H.S., Thirunarayan, K., Shalin, V., Sheth, A.: Location name extraction from targeted text streams using gazetteer-based statistical language models. arXiv preprint arXiv: 1708.03105 (2017)
Chiu, J.P.C., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016). https://doi.org/10.1162/tacl_a_00104
Harris, Z.S.: Distributional structure. Distrib. Struct. Word. 10, 146–162 (1954). https://doi.org/10.1080/00437956.1954.11659520
Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, EMNLP 2014. pp. 1532–1543 (2014)
Chen, Z., Pokharel, B., Li, B., Lim, S.: Location extraction from twitter messages using bidirectional long short-term memory model. In: Proceedings of the 6th International Conference on Geographical Information Systems Theory, Applications and Management. SCITEPRESS - Science and Technology Publications. pp. 45–50 (2020)
Ritter, A., Sam, C., Mausam, E.O.: Named entity recognition in tweets: an experimental study. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, pp. 1524–1534. Association for Computational Linguistics (2011)
Mishra, S., Diesner, J.: Semi-supervised named entity recognition in noisy-text. In: Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT). The COLING 2016 Organizing Committee. pp. 203–212 (2016)
Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009. pp. 248–256 (2009)
Acknowledgements
This research is sponsored by China Scholarship Council (CSC).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, Z., Pokharel, B., Li, B., Lim, S. (2021). Location Extraction from Twitter Messages Using a Bidirectional Long Short-Term Memory Neural Network with Conditional Random Field Model. In: Grueau, C., Laurini, R., Ragia, L. (eds) Geographical Information Systems Theory, Applications and Management. GISTAM 2020. Communications in Computer and Information Science, vol 1411. Springer, Cham. https://doi.org/10.1007/978-3-030-76374-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-76374-9_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-76373-2
Online ISBN: 978-3-030-76374-9
eBook Packages: Computer ScienceComputer Science (R0)