Location Extraction from Twitter Messages Using a Bidirectional Long Short-Term Memory Neural Network with Conditional Random Field Model

Chen, Zi; Pokharel, Badal; Li, Bingnan; Lim, Samsung

doi:10.1007/978-3-030-76374-9_2

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1411))

Included in the following conference series:

International Conference on Geographical Information Systems Theory, Applications and Management

315 Accesses
1 Citations

Abstract

In the context of disaster management, location information is crucial in disaster scenarios to infer the incident location and facilitate disaster relief. In recent years the advent of social media has brought not only great opportunity to enhance disaster management in a crowdsourced perspective, but also a major challenge to interpret the noisy information. A conventional approach to location extraction from texts is Named Entity Recognition (NER), however it shows unsatisfactory performance on informal and colloquial texts such as social media messages, especially for the uncommon place names. To address this issue, we proposed a Bidirectional Long Short-Term Memory (LSTM) Neural Network with Conditional Random Field (CRF) layer to identify geo-entities especially the rarely known local places in social media messages, and the use of orthographic, semantic and syntactic features was explored to achieve best performance. The proposed model was tested on a dataset collected from Twitter, showing promising performance in detecting location information when compared with off-the-shelf NER tools.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ahmed, A.: Use of social media in disaster management. In: International Conference on Information Systems, ICIS 2011. pp. 4149–4159 (2011)
Google Scholar
Chatfield, A.T., Brajawidagda, U.: Twitter early tsunami warning system: a case study in Indonesia’s natural disaster management. In: Proceedings of the Annual Hawaii International Conference on System Science, pp. 2050–2060 (2013)
Google Scholar
Nakaji, Y., Yanai, K.: Visualization of real-world events with geotagged tweet photos. In: Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, ICMEW, pp. 272–277. ICMEW (2012)
Google Scholar
Sultanik, E.A., Fink, C.: Rapid geotagging and disambiguation of social media text via an indexed gazetteer. In: ISCRAM 2012 Conference Proceedings - 9th International Conference on Information Systems for Crisis Response and Management (2012)
Google Scholar
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: an architecture for development of robust HLT applications. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, ACL 2002, p. 1688 (2001). https://doi.org/10.3115/1073083.1073112
Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by Gibbs sampling. In: ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. pp. 363–370 (2005)
Google Scholar
Kazama, J., Torisawa, K.: Inducing gazetteers for named entity recognition by large-scale clustering of dependency relations. In: Proceedings of the Conference of 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT, ACL 2008. pp. 407–415 (2008)
Google Scholar
Downey, D., Broadhead, M., Etzioni, O.: Locating complex named entities in web text. In: IJCAI International Joint Conference on Artificial Intelligence. pp. 2733 –2739 (2007)
Google Scholar
Bontcheva, K., Derczynski, L., Funk, A., Greenwood, M. A., Maynard, D., Aswani, N.: Twitie: An open-source information extraction pipeline for microblog text. In: Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013, 83–90 (2013)
Google Scholar
Lingad, J., Karimi, S., Yin, J.: Location extraction from disaster-related microblogs. In: Proceedings of the 22nd International Conference on World Wide Web, WWW 2013 Companion. pp. 1017–1020. ACM Press, New York (2013)
Google Scholar
Awan, Z., Kahlke, T., Ralph, P.J., Kennedy, P.J.: Chemical named entity recognition with deep contextualized neural embeddings. In: IC3K 2019 - Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. pp. 135–144 (2019)
Google Scholar
Rachman, V., Savitri, S., Augustianti, F., Mahendra, R.: Named entity recognition on Indonesian Twitter posts using long short-term memory networks. In: 2017 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017. pp. 228–232 (2018)
Google Scholar
Hoang, T.B.N., Mothe, J.: Location extraction from tweets. Inf. Process. Manage. 54, 129–144 (2018). https://doi.org/10.1016/j.ipm.2017.11.001
Article Google Scholar
Al-Olimat, H.S., Thirunarayan, K., Shalin, V., Sheth, A.: Location name extraction from targeted text streams using gazetteer-based statistical language models. arXiv preprint arXiv: 1708.03105 (2017)
Chiu, J.P.C., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016). https://doi.org/10.1162/tacl_a_00104
Article Google Scholar
Harris, Z.S.: Distributional structure. Distrib. Struct. Word. 10, 146–162 (1954). https://doi.org/10.1080/00437956.1954.11659520
Article Google Scholar
Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, EMNLP 2014. pp. 1532–1543 (2014)
Google Scholar
Chen, Z., Pokharel, B., Li, B., Lim, S.: Location extraction from twitter messages using bidirectional long short-term memory model. In: Proceedings of the 6th International Conference on Geographical Information Systems Theory, Applications and Management. SCITEPRESS - Science and Technology Publications. pp. 45–50 (2020)
Google Scholar
Ritter, A., Sam, C., Mausam, E.O.: Named entity recognition in tweets: an experimental study. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, pp. 1524–1534. Association for Computational Linguistics (2011)
Google Scholar
Mishra, S., Diesner, J.: Semi-supervised named entity recognition in noisy-text. In: Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT). The COLING 2016 Organizing Committee. pp. 203–212 (2016)
Google Scholar
Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009. pp. 248–256 (2009)
Google Scholar

Download references

Acknowledgements

This research is sponsored by China Scholarship Council (CSC).

Author information

Authors and Affiliations

School of Civil and Environmental Engineering, University of New South Wales, Sydney, Australia
Zi Chen, Badal Pokharel, Bingnan Li & Samsung Lim

Authors

Zi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Badal Pokharel
View author publications
You can also search for this author in PubMed Google Scholar
Bingnan Li
View author publications
You can also search for this author in PubMed Google Scholar
Samsung Lim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zi Chen .

Editor information

Editors and Affiliations

Polytechnic Institute of Setúbal, Setúbal, Portugal
Cédric Grueau
Knowledge Systems Institute, Skokie, IL, USA
Robert Laurini
Technical University of Crete, Chania, Greece
Lemonia Ragia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, Z., Pokharel, B., Li, B., Lim, S. (2021). Location Extraction from Twitter Messages Using a Bidirectional Long Short-Term Memory Neural Network with Conditional Random Field Model. In: Grueau, C., Laurini, R., Ragia, L. (eds) Geographical Information Systems Theory, Applications and Management. GISTAM 2020. Communications in Computer and Information Science, vol 1411. Springer, Cham. https://doi.org/10.1007/978-3-030-76374-9_2

Download citation

DOI: https://doi.org/10.1007/978-3-030-76374-9_2
Published: 18 May 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-76373-2
Online ISBN: 978-3-030-76374-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics