Exploiting Twitter for Informativeness Classification in Disaster Situations

Graf, David; Retschitzegger, Werner; Schwinger, Wieland; Pröll, Birgit; Kapsammer, Elisabeth

doi:10.1007/978-3-662-62308-4_2

David Graf¹⁵,
Werner Retschitzegger¹⁵,
Wieland Schwinger¹⁵,
Birgit Pröll¹⁶ &
…
Elisabeth Kapsammer¹⁵

Part of the book series: Lecture Notes in Computer Science ((TLDKS,volume 12390))

307 Accesses

Abstract

Disaster management urgently requires mechanisms for achieving situation awareness (SA) in a timely manner, allowing authorities to react in an appropriate way to reduce the impact on affected people and infrastructure. In such situations, no matter if they are human-induced like shootings or natural ones like earthquakes or floods, social media such as Twitter are frequently used communication channels, making them a highly valuable additional data source for enhancing SA. The challenge is, however, to identify out of the tremendous mass of irrelevant and non informative social media data those messages being really “informative”, i.e., contributing to SA in a certain disaster situation. Existing approaches on machine-learning driven informativeness classification most often focus on specific disaster types, such as shootings or floods, thus lacking general applicability and falling short in classification of new disaster events. Therefore, this article puts forward the following three contributions: First, in order to better understand the underlying social media data source, an in-depth analysis of existing Twitter data on 26 different disaster events is provided along temporal, spatial, linguistic, and source dimensions. Second, based thereupon, a cross-domain informativeness classifier is proposed being not focused on specific disaster types but rather allowing for classifications across different types. Third, the applicability of this cross-domain classifier is demonstrated, showing its accuracy compared to other disaster type specific approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.twitter.com.
2.
It has to be noted that a considerably shorter pre-version of this article has already been published in Proceeding of the 10^th International Conference on Management of Digital EcoSystems. ACM, Tokyo, Japan, Sept. 2018.
3.
http://www.crisislex.org/data-collections.html.
4.
https://cran.r-project.org/manuals.html.
5.
https://scikit-learn.org/stable/modules/generated/sklearn.decomposition.PCA.html.
6.
https://www.nltk.org/.
7.
https://pypi.python.org/pypi/langdetect.
8.
http://textblob.readthedocs.io/en/dev/index.html.
9.
https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.mutual_info_classif.html.

References

Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisc. Rev. Comput. Stat. 2(4), 433–459 (2010)
Article Google Scholar
Acerbo, F., Rossi, C.: Filtering informative tweets during emergencies: a machine learning approach. In: Proceedings of the 1st CoNEXT Workshop on ICT Tools for Emergency Networks and Disaster Relief, I-TENDER 2017, pp. 1–6. ACM, New York (2017)
Google Scholar
Avvenuti, M., Cimino, M.G.C.A., Cresci, S., Marchetti, A., Tesconi, M.: A framework for detecting unfolding emergencies using humans as sensors. SpringerPlus 5(1), 1–23 (2016). https://doi.org/10.1186/s40064-016-1674-y
Article Google Scholar
Cameron, M., Power, R., Robinson, B., Yin, J.: Emergency situation awareness from twitter for crisis management. In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012, pp. 695–698. ACM, New York (2012)
Google Scholar
Cresci, S., Tesconi, M., Cimino, A., Dell’Orletta, F.: A linguistically-driven approach to cross-event damage assessment of natural disasters from social media messages. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015, pp. 1195–1200. ACM (2015)
Google Scholar
Dai, W., Xue, G., Yang, Q., Yu, Y.: Transferring Naive Bayes classifiers for text classification. In: Proceedings of the 22nd International Conference on Association for the Advancement of Artificial Intelligence, AAAI 2007, vol. 7, pp. 540–545 (2007)
Google Scholar
Derczynski, L., Meesters, K., Bontcheva, K., Maynard, D.: Helping crisis responders find the informative needle in the tweet haystack. arXiv preprint arXiv:1801.09633 (2018)
Girtelschmid, S., Salfinger, A., Pröll, B., Retschitzegger, W., Schwinger, W.: Near real-time detection of crisis situations. In: Proceedings of 39th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2016, pp. 247–252. IEEE (2016)
Google Scholar
Horn, C., Zhila, A., Gelbukh, A., Kern, R., Lex, E.: Using factual density to measure informativeness of web documents. In: Proceedings of the 19th Nordic Conference of Computational Linguistics, NODALIDA 2013, pp. 227–238 (2013)
Google Scholar
Imran, M., Castillo, C., Diaz, F., Vieweg, S.: Processing social media messages in mass emergency: a survey. ACM Comput. Surv. (CSUR) 47, 1–38 (2015)
Article Google Scholar
Imran, M., Elbassuoni, S., Castillo, C., Diaz, F., Meier, P.: Extracting information nuggets from disaster-related messages in social media. In: Proceedings of the 10th Conference for Information Systems for Crisis Response and Management, ISCRAM 2013 (2013)
Google Scholar
Imran, M., Mitra, P., Srivastava, J.: Cross-language domain adaptation for classifying crisis-related short messages. arXiv preprint arXiv:1602.05388 (2016)
Khare, P., Burel, G., Alani, H.: Classifying crises-information relevancy with semantics. In: Gangemi, A., et al. (eds.) ESWC 2018. LNCS, vol. 10843, pp. 367–383. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93417-4_24
Chapter Google Scholar
Khare, P., Fernandez, M., Alani, H.: Statistical semantic classification of crisis information. In: 1st workshop of Hybrid Statistical Semantic Understanding and Emerging Semantics (HSSUES), 16th International Semantic Web Conference (2017)
Google Scholar
Khare, P., Burel, G., Maynard, D., Alani, H.: Cross-lingual classification of crisis data. In: Vrandečić, D.D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 617–633. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_36
Chapter Google Scholar
Kireyev, K.: Semantic-based estimation of term informativeness. In: Proceedings of the 2009 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL-HLT 2009, pp. 530–538. Association for Computational Linguistics (2009)
Google Scholar
Li, H., et al.: Twitter mining for disaster response: a domain adaptation approach. In: Proceedings of the 12th Conference for Information Systems for Crisis Response and Management, ISCRAM 2015 (2015)
Google Scholar
Lloret, E., Palomar, M.: Analysing and evaluating the task of automatic tweet generation: knowledge to business. Comput. Ind. 78, 3–15 (2016)
Article Google Scholar
Longhini, J., Rossi, C., Casetti, C., Angaramo, F.: A language-agnostic approach to exact informative tweets during emergency situations. In: International Conference on Big Data, Big Data 2017, pp. 3475–3739. IEEE (2017)
Google Scholar
Mohammad, S., Kiritchenko, S., Zhu, X.: NRC-Canada: building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:1308.6242 (2013)
Ning, X., Yao, L., Wang, X., Benatallah, B.: Calling for response: automatically distinguishing situation-aware tweets during crises. In: Cong, G., Peng, W.-C., Zhang, W.E., Li, C., Sun, A. (eds.) ADMA 2017. LNCS (LNAI), vol. 10604, pp. 195–208. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69179-4_14
Chapter Google Scholar
Olteanu, A., Vieweg, S., Castillo, C.: What to expect when the unexpected happens: social media communications across crises. In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, CSCW 2015, pp. 994–1009. ACM (2015)
Google Scholar
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Pekar, V., Binner, J., Najafi, H., Hale, C.: Selecting classification features for detection of mass emergency events on social media. In: Proceedings of the 15th International Conference on Security and Management, SAM 2016, The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (WorldComp), p. 192 (2016)
Google Scholar
Pekar, V., Binner, J., Najafi, H., Hale, C., Schmidt, V.: Early detection of heterogeneous disaster events using social media. J. Assoc. Inf. Sci. Technol. 71, 43–54 (2020)
Article Google Scholar
Ren, X., et al.: CoType: joint extraction of typed entities and relations with knowledge bases. In: Proceedings of the 26th International Conference on World Wide Web, WWW 2017, International World Wide Web Conference on Steering Committee, pp. 1015–1024 (2017)
Google Scholar
Rossi, C., et al.: Early detection and information extraction for weather-induced floods using social media streams. Int. J. Disaster Risk Reduct. 30, 145–157 (2018)
Article Google Scholar
Salfinger, A.: Staying aware in an evolving world. Ph.D. thesis, Johannes Kepler University of Linz (2016)
Google Scholar
Salfinger, A., Salfinger, C., Pröll, B., Retschitzegger, W., Schwinger, W.: Pinpointing the eye of the hurricane-creating a gold-standard corpus for situative geo-coding of crisis tweets based on linked open data. In: LDL 2016 5th Workshop on Linked Data in Linguistics: Managing, Building and Using Linked Language Resources, p. 27 (2016)
Google Scholar
Salfinger, A., Schwinger, W., Retschitzegger, W., Pröll, B.: Mining the disaster hotspots-situation-adaptive crowd knowledge extraction for crisis management. In: Proceedings of the 2016 Multi-Disciplinary International Conference on Cognitive Methods in Situation Awareness and Decision Support, CogSIMA 2016, pp. 212–218. IEEE (2016)
Google Scholar
Stowe, K., Paul, M., Palmer, M., Palen, L., Anderson, K.: Identifying and categorizing disaster-related tweets. In: Proceedings of The 4th International Workshop on Natural Language Processing for Social Media, pp. 1–6 (2016)
Google Scholar
Tsuchida, T., Kato, D., Endo, M., Hirota, M., Araki, T., Ishikawa, H.: Analyzing Relationship of words using biased LexRank from geotagged tweets. In: Proceedings of the 9th International Conference on Management of Digital Ecosystems, MEDES 2017, pp. 42–49. ACM, New York (2017)
Google Scholar
Verma, S., et al.: Natural language processing to the rescue? Extracting “situational awareness” tweets during mass emergency. In: Proceedings of the 5th Conference on Weblogs and Social Media, ICWSM 2011 (2011)
Google Scholar
Vieweg, S.: Situational awareness in mass emergency: a behavioral and linguistic analysis of microblogged communications. Ph.D. thesis, University of Colorado at Boulder (2012)
Google Scholar
Vieweg, S., Hughes, A.L., Starbird, K., Palen, L.: Microblogging during two natural hazards events: what twitter may contribute to situational awareness. In: Proceedings of the Conference on Human Factors in Computing Systems, CHI 2010, pp. 1079–1088. ACM (2010)
Google Scholar
Wong, B., Kit, C.: Comparative evaluation of term informativeness measures for machine translation evaluation metrics. In: Proceedings of the 13th Conference of Machine Translation Summit, vol. 2011, pp. 537–544 (2011)
Google Scholar
Wu, Z., Giles, C.: Measuring term informativeness in context. In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013, pp. 259–269 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Telecooperation, Department of Cooperative Information Systems, Johannes Kepler University, Linz, Austria
David Graf, Werner Retschitzegger, Wieland Schwinger & Elisabeth Kapsammer
Institute for Application Oriented Knowledge Processing, Johannes Kepler University, Linz, Austria
Birgit Pröll

Authors

David Graf
View author publications
You can also search for this author in PubMed Google Scholar
Werner Retschitzegger
View author publications
You can also search for this author in PubMed Google Scholar
Wieland Schwinger
View author publications
You can also search for this author in PubMed Google Scholar
Birgit Pröll
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth Kapsammer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Graf .

Editor information

Editors and Affiliations

IRIT, Paul Sabatier University, Toulouse, France
Abdelkader Hameurlain
Vienna University of Technology, Vienna, Austria
A Min Tjoa
University of Pau and Pays de l'Adour, Anglet, France
Richard Chbeir
Open University of Cyprus, Nicosia, Cyprus
Yannis Manolopoulos
Tokyo Metropolitan University, Tokyo, Japan
Hiroshi Ishikawa
University of Zaragoza, Zaragoza, Spain
Sergio Ilarri
Aristotle University of Thessaloniki, Thessaloniki, Greece
Apostolos Papadopoulos

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Graf, D., Retschitzegger, W., Schwinger, W., Pröll, B., Kapsammer, E. (2020). Exploiting Twitter for Informativeness Classification in Disaster Situations. In: Hameurlain, A., et al. Transactions on Large-Scale Data- and Knowledge-Centered Systems XLV. Lecture Notes in Computer Science(), vol 12390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-62308-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-662-62308-4_2
Published: 20 September 2020
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-62307-7
Online ISBN: 978-3-662-62308-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics