skip to main content
10.1145/3209978.3210030acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs

Published: 27 June 2018 Publication History

Abstract

In recent times, humanitarian organizations increasingly rely on social media to search for information useful for disaster response. These organizations have varying information needs ranging from general situational awareness (i.e., to understand a bigger picture) to focused information needs e.g., about infrastructure damage, urgent needs of affected people. This research proposes a novel approach to help crisis responders fulfill their information needs at different levels of granularities. Specifically, the proposed approach presents simple algorithms to identify sub-events and generate summaries of big volume of messages around those events using an Integer Linear Programming (ILP) technique. Extensive evaluation on a large set of real world Twitter dataset shows (a). our algorithm can identify important sub-events with high recall (b). the summarization scheme shows (6---30%) higher accuracy of our system compared to many other state-of-the-art techniques. The simplicity of the algorithms ensures that the entire task is done in real time which is needed for practical deployment of the system.

References

[1]
Dhekar Abhik and Durga Toshniwal. 2013. Sub-event detection during natural hazards using features of social media data. In Proceedings of the 22nd International Conference on World Wide Web. ACM, 783--788.
[2]
Allison Badgett and Ruihong Huang. 2016. Extracting Subevents via an Effective Two-phase Approach. In EMNLP. 906--911.
[3]
David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022.
[4]
Dongfeng Cai, Yonghua Hu, Xuelei Miao, and Yan Song. 2009. Dependency Grammar Based English Subject-Verb Agreement Evaluation. In PACLIC. Citeseer, 63--71. {5} Mark A. Cameron, Robert Power, Bella Robinson, and Jie Yin. 2012. Emergency Situation Awareness from Twitter for Crisis Management. In Proc. WWW. ACM, 695--698.
[5]
Carlos Castillo. 2016. Big Crisis Data: Social Media in Disasters and Time-Critical Situations (1st ed.). Cambridge University Press, New York, NY, USA.
[6]
Gunes Erkan and Dragomir R. Radev. 2004. LexRank:Graph-based lexical centrality as salience in text summarization. Artificial Intelligence Research 22 (2004), 457--479.
[7]
Huiji Gao, Geoffrey Barbier, and Rebecca Goolsby. 2011. Harnessing the Crowdsourcing Power of Social Media for Disaster Relief. Intelligent Systems, IEEE 26, 3 (2011), 10--14.
[8]
Kevin Gimpel, Nathan Schneider, Brendan O'Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein, Michael Heilman, Dani Yogatama, Jeffrey Flanigan, and Noah Smith, A. 2011. Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments. In Proc. ACL.
[9]
gurobi 2015. Gurobi -- The overall fastest and best supported solver available. http://www.gurobi.com/.
[10]
Muhammad Imran, Carlos Castillo, Fernando Diaz, and Sarah Vieweg. 2015. Processing social media messages in mass emergency: a survey. ACM Computing Surveys (CSUR) 47, 4 (2015), 67.
[11]
Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier, and Sarah Vieweg. 2014. Aidr: Artificial intelligence for disaster response. In Proc. WWW companion. 159--162.
[12]
Chris Kedzie, Kathleen McKeown, and Fernando Diaz. 2015. Predicting Salient Updates for Disaster Summarization. In Proc. ACL. Beijing, China, 1608--1617.
[13]
Lingpeng Kong, Nathan Schneider, Swabha Swayamdipta, Archna Bhatia, Chris Dyer, and Noah A. Smith. 2014. A Dependency Parser for Tweets. In Proc. EMNLP.
[14]
Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Proc. Workshop on Text Summarization Branches Out (with ACL).
[15]
Polykarpos Meladianos, Giannis Nikolentzos, François Rousseau, Yannis Stavrakas, and Michalis Vazirgiannis. 2015. Degeneracy-based real-time sub-event detection in twitter stream. In Proc. AAAI ICWSM. 248--257.
[16]
Minh-Tien Nguyen, Asanobu Kitamoto, and Tri-Thanh Nguyen. 2015. TSum4act: A Framework for Retrieving and Summarizing Actionable Tweets during a Disaster for Reaction. In Proc. PAKDD.
[17]
Miles Osborne, Sean Moran, Richard McCreadie, Alexander Von Lunen, Martin Sykora, Elizabeth Cano, Neil Ireson, Craig Macdonald, Iadh Ounis, Yulan He, Tom Jackson, Fabio Ciravegna, and Ann OBrien. 2014. Real-Time Detection, Tracking, and Monitoring of Automatically Discovered Events in Social Media. In Proc. ACL.
[18]
Patrick Pantel and Dekang Lin. 2002. Discovering word senses from text. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 613--619.
[19]
Daniela Pohl, Abdelhamid Bouchachia, and Hermann Hellwagner. 2012. Automatic sub-event detection in emergency management using social media. In Proc. WWW. ACM, 683--686.
[20]
Koustav Rudra, Siddhartha Banerjee, Niloy Ganguly, Pawan Goyal, Muhammad Imran, and Prasenjit Mitra. 2016. Summarizing Situational Tweets in Crisis Scenario. In Proceedings of the 27th ACM Conference on Hypertext and Social Media. ACM, 137--147. {22} Koustav Rudra, Subham Ghosh, Niloy Ganguly, Pawan Goyal, and Saptarshi Ghosh. 2015. Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach. In Proc. CIKM.
[21]
Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2010. Earthquake shakes Twitter users: real-time event detection by social sensors. In Proc. WWW. 851--860.
[22]
Lidan Shou, Zhenhua Wang, Ke Chen, and Gang Chen. 2013. Sumblr: Continuous Summarization of Evolving Tweet Streams. In Proc. ACM SIGIR. 533--542.
[23]
summary-matrix 2017. Summary Matrix - Kemeny-Young method. https: //en.wikipedia.org/wiki/Kemeny-Young_method.
[24]
Lynda Tamine, Laure Soulier, Lamjed Ben Jabeur, Frederic Amblard, Chihab Hanachi, Gilles Hubert, and Camille Roth. 2016. Social Media-Based Collaborative Information Access: Analysis of Online Crisis-Related Twitter Conversations. In ACM 27th Conference on Hypertext & Social Media.
[25]
Istvan Varga, Motoki Sano, Kentaro Torisawa, Chikara Hashimoto, Kiyonori Ohtake, Takao Kawai, Jong-Hoon Oh, and Stijn De Saeger. 2013. Aid is Out There: Looking for Help from Tweets during a Large Scale Disaster. In Proc. ACL.
[26]
Sudha Verma, Sarah Vieweg, William J. Corvey, Leysia Palen, James H. Martin, Martha Palmer, Aaron Schram, and Kenneth M. Anderson. 2011. Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency. In Proc. AAAI ICWSM.
[27]
Sarah Vieweg, Carlos Castillo, and Muhammad Imran. 2014. Integrating social media communications into the rapid assessment of sudden onset disasters. In Social Informatics. Springer, 444--461.
[28]
Xiaohui Yan, Jiafeng Guo, Yanyan Lan, and Xueqi Cheng. 2013. A biterm topic model for short texts. In Proc. WWW. ACM, 1445--1456.

Cited By

View all
  • (2024)Performance Analysis of Tweet Summarization Techniques Considering Crisis DynamicsProceedings of the 25th International Conference on Distributed Computing and Networking10.1145/3631461.3631951(418-423)Online publication date: 4-Jan-2024
  • (2024)Human vs ChatGPT: Effect of Data Annotation in Interpretable Crisis-Related Microblog ClassificationProceedings of the ACM Web Conference 202410.1145/3589334.3648141(4534-4543)Online publication date: 13-May-2024
  • (2024)D$i$E$v$D: Disruptive Event Detection From Dynamic Datastreams Using Continual Machine Learning: A Case Study With TwitterIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2023.327297312:3(727-738)Online publication date: Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
June 2018
1509 pages
ISBN:9781450356572
DOI:10.1145/3209978
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. class-based summarization
  2. high-level summarization
  3. humanitarian classes
  4. situational information
  5. sub-event detection

Qualifiers

  • Research-article

Funding Sources

  • Army Research Lab

Conference

SIGIR '18
Sponsor:

Acceptance Rates

SIGIR '18 Paper Acceptance Rate 86 of 409 submissions, 21%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)41
  • Downloads (Last 6 weeks)2
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Performance Analysis of Tweet Summarization Techniques Considering Crisis DynamicsProceedings of the 25th International Conference on Distributed Computing and Networking10.1145/3631461.3631951(418-423)Online publication date: 4-Jan-2024
  • (2024)Human vs ChatGPT: Effect of Data Annotation in Interpretable Crisis-Related Microblog ClassificationProceedings of the ACM Web Conference 202410.1145/3589334.3648141(4534-4543)Online publication date: 13-May-2024
  • (2024)D$i$E$v$D: Disruptive Event Detection From Dynamic Datastreams Using Continual Machine Learning: A Case Study With TwitterIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2023.327297312:3(727-738)Online publication date: Jul-2024
  • (2024)Extracting the Full Story: A Multimodal Approach and Dataset to Crisis Summarization in TweetsIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.343669011:6(7846-7856)Online publication date: Dec-2024
  • (2024)A Trustworthy Approach to Classify and Analyze Epidemic-Related Information From MicroblogsIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.339139511:5(6229-6241)Online publication date: Oct-2024
  • (2024)Online Summarization of Microblog Data: An Aid in Handling Disaster SituationsIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.334752011:3(4029-4039)Online publication date: Jun-2024
  • (2024)OntoDSumm: Ontology-Based Tweet Summarization for Disaster EventsIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.326602511:2(2724-2739)Online publication date: Apr-2024
  • (2024) GenSumm : A Joint Framework for Multi-Task Tweet Classification and Summarization Using Sentiment Analysis and Generative Modelling IEEE Transactions on Affective Computing10.1109/TAFFC.2021.313151615:4(1838-1855)Online publication date: Oct-2024
  • (2024)Multitask Sentiment Analysis Integrated Machine Learning Framework to Classify Informative Tweets From Multidomains2024 Second International Conference on Inventive Computing and Informatics (ICICI)10.1109/ICICI62254.2024.00101(584-589)Online publication date: 11-Jun-2024
  • (2024)Exploration of Topic Relevance in Summaries for Re-summarizing using LLM in Crisis Chronology2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT61001.2024.10725161(1-7)Online publication date: 24-Jun-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media