skip to main content
10.1145/3152896.3152897acmconferencesArticle/Chapter ViewAbstractPublication PagesconextConference Proceedingsconference-collections
research-article

Filtering informative tweets during emergencies: a machine learning approach

Published:11 December 2017Publication History

ABSTRACT

Thanks to their worldwide extension and speed, online social networks have become a common and effective way of communication throughout emergencies. The messages posted during a disaster may be either crisis-relevant (alerts, help requests, damage descriptions, etc.) or not (feelings, opinions, etc.) In this paper, we propose a machine learning approach for creating a classifier able to distinguish between informative and not informative messages, and to understand common patterns inside these two classes. We also investigate similarities and differences in the words that mostly occur across three different natural disasters: fire, earthquake and floods. The results, obtained with real data extracted from Twitter during past emergency events, demonstrate the viability of our approach in providing a filtering service able to deliver only informative contents to crisis managers in a view of improving the operational picture during emergency situations.

References

  1. A. Olteanu, S. Vieweg, and C. Castillo. What to expect when the unexpected happens: Social media communications across crises. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work and Social Computing, CSCW '15, pages 994--1009, New York, NY, USA, 2015. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. C. Castillo. Big Crisis Data: Social Media in Disasters and Time-Critical Situations. Cambridge University Press, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Mendoza, B. Poblete, and C. Castillo. Twitter under crisis: can we trust what we rt? In Proceedings of the First Workshop on Social Media Analytics, SOMA '10, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Jackoway, H. Samet, and J. Sankaranarayanan. Identification of live news events using twitter. In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN '11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bernhard et .al Klein. Emergency Event Detection in Twitter Streams Based on Natural Language Processing, pages 239--246. Springer International Publishing, 2013.Google ScholarGoogle Scholar
  6. Cornelia Caragea, Nathan McNeese, Anuj Jaiswal, Greg Traylor, Hyun-Woo Kim, Prasenjit Mitra, Dinghao Wu, Andrea H. Tapia, Lee Giles, Bernard J. Jansen, and John Yen. Classifying text messages for the haiti earthquake. In Proceedings of the 8th International ISCRAM Conference, 2011.Google ScholarGoogle Scholar
  7. J. Longhini. Automatic classification of social media streams during emergencies: a twitter case study, 2016.Google ScholarGoogle Scholar
  8. F. Benevenuto, G. Magno, T. Rodrigues, and V. Almeida. Detecting spammers on twitter. In Proceedings of the Collaboration, Electronic messaging, Anti-Abuse adn Spam Conference, CEAS '10, 2010.Google ScholarGoogle Scholar
  9. M. Jenders, G. Kasneci, and F. Naumann. Analyzing and predicting viral tweets. In Internation World Wide Web Conference, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. Castillo, M. Mendoza, and B. Poblete. Information credibility on twitter. In Internation World Wide Web Conference, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Zaman, E. Fox, and E. Bradlow. A bayesian approach for predicting the popularity of tweets. Annals of Applied Statistics, 8(3):1583--1611, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  12. I. Feinerer. Introduction to the tm package: Text mining in r, 2017.Google ScholarGoogle Scholar
  13. M. Kuhn. A short introduction to the caret package, 2016.Google ScholarGoogle Scholar
  14. m.eik michalke. Using the korpus package for text analysis, 2017.Google ScholarGoogle Scholar
  15. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems, NIPS'13, pages 3111--3119, USA, 2013. Curran Associates Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Filtering informative tweets during emergencies: a machine learning approach

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        I-TENDER '17: Proceedings of the First CoNEXT Workshop on ICT Tools for Emergency Networks and DisastEr Relief
        December 2017
        53 pages
        ISBN:9781450354240
        DOI:10.1145/3152896

        Copyright © 2017 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 December 2017

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader