Skip to main content

Exploiting Twitter for Informativeness Classification in Disaster Situations

  • Chapter
  • First Online:
Book cover Transactions on Large-Scale Data- and Knowledge-Centered Systems XLV

Part of the book series: Lecture Notes in Computer Science ((TLDKS,volume 12390))

  • 307 Accesses

Abstract

Disaster management urgently requires mechanisms for achieving situation awareness (SA) in a timely manner, allowing authorities to react in an appropriate way to reduce the impact on affected people and infrastructure. In such situations, no matter if they are human-induced like shootings or natural ones like earthquakes or floods, social media such as Twitter are frequently used communication channels, making them a highly valuable additional data source for enhancing SA. The challenge is, however, to identify out of the tremendous mass of irrelevant and non informative social media data those messages being really “informative”, i.e., contributing to SA in a certain disaster situation. Existing approaches on machine-learning driven informativeness classification most often focus on specific disaster types, such as shootings or floods, thus lacking general applicability and falling short in classification of new disaster events. Therefore, this article puts forward the following three contributions: First, in order to better understand the underlying social media data source, an in-depth analysis of existing Twitter data on 26 different disaster events is provided along temporal, spatial, linguistic, and source dimensions. Second, based thereupon, a cross-domain informativeness classifier is proposed being not focused on specific disaster types but rather allowing for classifications across different types. Third, the applicability of this cross-domain classifier is demonstrated, showing its accuracy compared to other disaster type specific approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.twitter.com.

  2. 2.

    It has to be noted that a considerably shorter pre-version of this article has already been published in Proceeding of the 10th International Conference on Management of Digital EcoSystems. ACM, Tokyo, Japan, Sept. 2018.

  3. 3.

    http://www.crisislex.org/data-collections.html.

  4. 4.

    https://cran.r-project.org/manuals.html.

  5. 5.

    https://scikit-learn.org/stable/modules/generated/sklearn.decomposition.PCA.html.

  6. 6.

    https://www.nltk.org/.

  7. 7.

    https://pypi.python.org/pypi/langdetect.

  8. 8.

    http://textblob.readthedocs.io/en/dev/index.html.

  9. 9.

    https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.mutual_info_classif.html.

References

  1. Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisc. Rev. Comput. Stat. 2(4), 433–459 (2010)

    Article  Google Scholar 

  2. Acerbo, F., Rossi, C.: Filtering informative tweets during emergencies: a machine learning approach. In: Proceedings of the 1st CoNEXT Workshop on ICT Tools for Emergency Networks and Disaster Relief, I-TENDER 2017, pp. 1–6. ACM, New York (2017)

    Google Scholar 

  3. Avvenuti, M., Cimino, M.G.C.A., Cresci, S., Marchetti, A., Tesconi, M.: A framework for detecting unfolding emergencies using humans as sensors. SpringerPlus 5(1), 1–23 (2016). https://doi.org/10.1186/s40064-016-1674-y

    Article  Google Scholar 

  4. Cameron, M., Power, R., Robinson, B., Yin, J.: Emergency situation awareness from twitter for crisis management. In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012, pp. 695–698. ACM, New York (2012)

    Google Scholar 

  5. Cresci, S., Tesconi, M., Cimino, A., Dell’Orletta, F.: A linguistically-driven approach to cross-event damage assessment of natural disasters from social media messages. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015, pp. 1195–1200. ACM (2015)

    Google Scholar 

  6. Dai, W., Xue, G., Yang, Q., Yu, Y.: Transferring Naive Bayes classifiers for text classification. In: Proceedings of the 22nd International Conference on Association for the Advancement of Artificial Intelligence, AAAI 2007, vol. 7, pp. 540–545 (2007)

    Google Scholar 

  7. Derczynski, L., Meesters, K., Bontcheva, K., Maynard, D.: Helping crisis responders find the informative needle in the tweet haystack. arXiv preprint arXiv:1801.09633 (2018)

  8. Girtelschmid, S., Salfinger, A., Pröll, B., Retschitzegger, W., Schwinger, W.: Near real-time detection of crisis situations. In: Proceedings of 39th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2016, pp. 247–252. IEEE (2016)

    Google Scholar 

  9. Horn, C., Zhila, A., Gelbukh, A., Kern, R., Lex, E.: Using factual density to measure informativeness of web documents. In: Proceedings of the 19th Nordic Conference of Computational Linguistics, NODALIDA 2013, pp. 227–238 (2013)

    Google Scholar 

  10. Imran, M., Castillo, C., Diaz, F., Vieweg, S.: Processing social media messages in mass emergency: a survey. ACM Comput. Surv. (CSUR) 47, 1–38 (2015)

    Article  Google Scholar 

  11. Imran, M., Elbassuoni, S., Castillo, C., Diaz, F., Meier, P.: Extracting information nuggets from disaster-related messages in social media. In: Proceedings of the 10th Conference for Information Systems for Crisis Response and Management, ISCRAM 2013 (2013)

    Google Scholar 

  12. Imran, M., Mitra, P., Srivastava, J.: Cross-language domain adaptation for classifying crisis-related short messages. arXiv preprint arXiv:1602.05388 (2016)

  13. Khare, P., Burel, G., Alani, H.: Classifying crises-information relevancy with semantics. In: Gangemi, A., et al. (eds.) ESWC 2018. LNCS, vol. 10843, pp. 367–383. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93417-4_24

    Chapter  Google Scholar 

  14. Khare, P., Fernandez, M., Alani, H.: Statistical semantic classification of crisis information. In: 1st workshop of Hybrid Statistical Semantic Understanding and Emerging Semantics (HSSUES), 16th International Semantic Web Conference (2017)

    Google Scholar 

  15. Khare, P., Burel, G., Maynard, D., Alani, H.: Cross-lingual classification of crisis data. In: Vrandečić, D.D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 617–633. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_36

    Chapter  Google Scholar 

  16. Kireyev, K.: Semantic-based estimation of term informativeness. In: Proceedings of the 2009 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL-HLT 2009, pp. 530–538. Association for Computational Linguistics (2009)

    Google Scholar 

  17. Li, H., et al.: Twitter mining for disaster response: a domain adaptation approach. In: Proceedings of the 12th Conference for Information Systems for Crisis Response and Management, ISCRAM 2015 (2015)

    Google Scholar 

  18. Lloret, E., Palomar, M.: Analysing and evaluating the task of automatic tweet generation: knowledge to business. Comput. Ind. 78, 3–15 (2016)

    Article  Google Scholar 

  19. Longhini, J., Rossi, C., Casetti, C., Angaramo, F.: A language-agnostic approach to exact informative tweets during emergency situations. In: International Conference on Big Data, Big Data 2017, pp. 3475–3739. IEEE (2017)

    Google Scholar 

  20. Mohammad, S., Kiritchenko, S., Zhu, X.: NRC-Canada: building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:1308.6242 (2013)

  21. Ning, X., Yao, L., Wang, X., Benatallah, B.: Calling for response: automatically distinguishing situation-aware tweets during crises. In: Cong, G., Peng, W.-C., Zhang, W.E., Li, C., Sun, A. (eds.) ADMA 2017. LNCS (LNAI), vol. 10604, pp. 195–208. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69179-4_14

    Chapter  Google Scholar 

  22. Olteanu, A., Vieweg, S., Castillo, C.: What to expect when the unexpected happens: social media communications across crises. In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, CSCW 2015, pp. 994–1009. ACM (2015)

    Google Scholar 

  23. Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  24. Pekar, V., Binner, J., Najafi, H., Hale, C.: Selecting classification features for detection of mass emergency events on social media. In: Proceedings of the 15th International Conference on Security and Management, SAM 2016, The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (WorldComp), p. 192 (2016)

    Google Scholar 

  25. Pekar, V., Binner, J., Najafi, H., Hale, C., Schmidt, V.: Early detection of heterogeneous disaster events using social media. J. Assoc. Inf. Sci. Technol. 71, 43–54 (2020)

    Article  Google Scholar 

  26. Ren, X., et al.: CoType: joint extraction of typed entities and relations with knowledge bases. In: Proceedings of the 26th International Conference on World Wide Web, WWW 2017, International World Wide Web Conference on Steering Committee, pp. 1015–1024 (2017)

    Google Scholar 

  27. Rossi, C., et al.: Early detection and information extraction for weather-induced floods using social media streams. Int. J. Disaster Risk Reduct. 30, 145–157 (2018)

    Article  Google Scholar 

  28. Salfinger, A.: Staying aware in an evolving world. Ph.D. thesis, Johannes Kepler University of Linz (2016)

    Google Scholar 

  29. Salfinger, A., Salfinger, C., Pröll, B., Retschitzegger, W., Schwinger, W.: Pinpointing the eye of the hurricane-creating a gold-standard corpus for situative geo-coding of crisis tweets based on linked open data. In: LDL 2016 5th Workshop on Linked Data in Linguistics: Managing, Building and Using Linked Language Resources, p. 27 (2016)

    Google Scholar 

  30. Salfinger, A., Schwinger, W., Retschitzegger, W., Pröll, B.: Mining the disaster hotspots-situation-adaptive crowd knowledge extraction for crisis management. In: Proceedings of the 2016 Multi-Disciplinary International Conference on Cognitive Methods in Situation Awareness and Decision Support, CogSIMA 2016, pp. 212–218. IEEE (2016)

    Google Scholar 

  31. Stowe, K., Paul, M., Palmer, M., Palen, L., Anderson, K.: Identifying and categorizing disaster-related tweets. In: Proceedings of The 4th International Workshop on Natural Language Processing for Social Media, pp. 1–6 (2016)

    Google Scholar 

  32. Tsuchida, T., Kato, D., Endo, M., Hirota, M., Araki, T., Ishikawa, H.: Analyzing Relationship of words using biased LexRank from geotagged tweets. In: Proceedings of the 9th International Conference on Management of Digital Ecosystems, MEDES 2017, pp. 42–49. ACM, New York (2017)

    Google Scholar 

  33. Verma, S., et al.: Natural language processing to the rescue? Extracting “situational awareness” tweets during mass emergency. In: Proceedings of the 5th Conference on Weblogs and Social Media, ICWSM 2011 (2011)

    Google Scholar 

  34. Vieweg, S.: Situational awareness in mass emergency: a behavioral and linguistic analysis of microblogged communications. Ph.D. thesis, University of Colorado at Boulder (2012)

    Google Scholar 

  35. Vieweg, S., Hughes, A.L., Starbird, K., Palen, L.: Microblogging during two natural hazards events: what twitter may contribute to situational awareness. In: Proceedings of the Conference on Human Factors in Computing Systems, CHI 2010, pp. 1079–1088. ACM (2010)

    Google Scholar 

  36. Wong, B., Kit, C.: Comparative evaluation of term informativeness measures for machine translation evaluation metrics. In: Proceedings of the 13th Conference of Machine Translation Summit, vol. 2011, pp. 537–544 (2011)

    Google Scholar 

  37. Wu, Z., Giles, C.: Measuring term informativeness in context. In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013, pp. 259–269 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Graf .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer-Verlag GmbH Germany, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Graf, D., Retschitzegger, W., Schwinger, W., Pröll, B., Kapsammer, E. (2020). Exploiting Twitter for Informativeness Classification in Disaster Situations. In: Hameurlain, A., et al. Transactions on Large-Scale Data- and Knowledge-Centered Systems XLV. Lecture Notes in Computer Science(), vol 12390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-62308-4_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-62308-4_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-62307-7

  • Online ISBN: 978-3-662-62308-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics