skip to main content
research-article

GeoBurst+: Effective and Real-Time Local Event Detection in Geo-Tagged Tweet Streams

Published: 18 January 2018 Publication History

Abstract

The real-time discovery of local events (e.g., protests, disasters) has been widely recognized as a fundamental socioeconomic task. Recent studies have demonstrated that the geo-tagged tweet stream serves as an unprecedentedly valuable source for local event detection. Nevertheless, how to effectively extract local events from massive geo-tagged tweet streams in real time remains challenging. To bridge the gap, we propose a method for effective and real-time local event detection from geo-tagged tweet streams. Our method, named GeoBurst+, first leverages a novel cross-modal authority measure to identify several pivots in the query window. Such pivots reveal different geo-topical activities and naturally attract similar tweets to form candidate events. GeoBurst+ further summarizes the continuous stream and compares the candidates against the historical summaries to pinpoint truly interesting local events. Better still, as the query window shifts, GeoBurst+ is capable of updating the event list with little time cost, thus achieving continuous monitoring of the stream. We used crowdsourcing to evaluate GeoBurst+ on two million-scale datasets and found it significantly more effective than existing methods while being orders of magnitude faster.

References

[1]
Hamed Abdelhaq, Christian Sengstock, and Michael Gertz. 2013. Eventweet: Online localized event detection from twitter. PVLDB 6, 12 (2013), 1326--1329.
[2]
Charu C. Aggarwal, Jiawei Han, Jianyong Wang, and Philip S. Yu. 2003. A framework for clustering evolving data streams. In VLDB. 81--92.
[3]
Charu C. Aggarwal and Karthik Subbian. 2012. Event detection in social streams. In SDM. 624--635.
[4]
James Allan, Ron Papka, and Victor Lavrenko. 1998. On-line new event detection and tracking. In SIGIR. 37--45.
[5]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3, 1 (2003), 993--1022.
[6]
Ling Chen and Abhishek Roy. 2009. Event detection from flickr data through wavelet-based spatial analysis. In CIKM. 523--532.
[7]
Dorin Comaniciu and Peter Meer. 1999. Mean shift analysis and applications. In ICCV. 1197--1203.
[8]
Son Doan, Bao-Khanh Ho Vo, and Nigel Collier. 2012. An analysis of twitter messages in the 2011 tohoku earthquake. In Electronic Healthcare. Springer, 58--66.
[9]
Wei Feng, Chao Zhang, Wei Zhang, Jiawei Han, Jianyong Wang, Charu Aggarwal, and Jianbin Huang. 2015. STREAMCUBE: Hierarchical spatio-temporal hashtag clustering for event exploration over the Twitter stream. In ICDE. 1561--1572.
[10]
John Foley, Michael Bendersky, and Vanja Josifovski. 2015. Learning to extract local events from the web. In SIGIR. 423--432.
[11]
Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Philip S. Yu, and Hongjun Lu. 2005. Parameter free bursty events detection in text streams. In VLDB. 181--192.
[12]
Kevin Gimpel, Nathan Schneider, Brendan O’Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein, Michael Heilman, Dani Yogatama, Jeffrey Flanigan, and Noah A. Smith. 2011. Part-of-speech tagging for twitter: Annotation, features, and experiments. In ACL. 42--47.
[13]
Jinjin Guo and Zhiguo Gong. 2016. A nonparametric model for event discovery in the geospatial-temporal space. In CIKM. 499--508.
[14]
Qi He, Kuiyu Chang, and Ee-Peng Lim. 2007. Analyzing feature trajectories for event detection. In SIGIR. 207--214.
[15]
Liangjie Hong, Amr Ahmed, Siva Gurumurthy, Alexander J. Smola, and Kostas Tsioutsiouliklis. 2012. Discovering geographical topics in the twitter stream. In WWW. 769--778.
[16]
Glen Jeh and Jennifer Widom. 2003. Scaling personalized web search. In WWW. 271--279.
[17]
Wei Kang, Anthony K. H. Tung, Wei Chen, Xinyu Li, Qiyue Song, Chao Zhang, Feng Zhao, and Xiajuan Zhou. 2014. Trendspedia: An Internet observatory for analyzing and visualizing the evolving web. In ICDE. 1206--1209.
[18]
Christoph Carl Kling, Jérôme Kunegis, Sergej Sizov, and Steffen Staab. 2014. Detecting non-Gaussian geographical topics in tagged photo collections. In WSDM. 603--612.
[19]
John Krumm and Eric Horvitz. 2015. Eyewitness: Identifying local events via space-time signals in twitter feeds. In SIGSPATIAL.
[20]
Quoc V. Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In ICML, vol. 14. 1188--1196.
[21]
Chenliang Li, Aixin Sun, and Anwitaman Datta. 2012. Twevent: Segment-based event detection from tweets. In CIKM. 155--164.
[22]
Rui Li, Kin Hou Lei, Ravi Khadiwala, and KC-C. Chang. 2012. Tedas: A twitter-based event detection and analysis system. In ICDE. 1273--1276.
[23]
Peter Lofgren and Ashish Goel. 2013. Personalized pagerank to a target node. arXiv:1304.4658 (2013).
[24]
Michael Mathioudakis and Nick Koudas. 2010. Twittermonitor: Trend detection over the twitter stream. In SIGMOD. 1155--1158.
[25]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Gregory S. Corrado, and Jeffrey Dean. 2013. Distributed representations of words and phrases and their compositionality. In NIPS. 3111--3119.
[26]
Sathappan Muthiah, Patrick Butler, Rupinder Paul Khandpur, Parang Saraf, Nathan Self, Alla Rozovskaya, Liang Zhao, Jose Cadena, Chang-Tien Lu, Anil Vullikanti, Achla Marathe, Kristen Maria Summers, Graham Katz, Andy Doyle, Jaime Arredondo, Dipak K. Gupta, David Mares, and Naren Ramakrishnan. 2016. EMBERS at 4 years: Experiences operating an open source indicators forecasting system. In KDD. 205--214.
[27]
Swit Phuvipadawat and Tsuyoshi Murata. 2010. Breaking news detection and tracking in twitter. In WI-IAT. 120--123.
[28]
Mauricio Quezada, Vanessa Peña-Araya, and Barbara Poblete. 2015. Location-aware model for news events in social media. In SIGIR. 935--938.
[29]
Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2010. Earthquake shakes twitter users: Real-time event detection by social sensors. In WWW. 851--860.
[30]
Jagan Sankaranarayanan, Hanan Samet, Benjamin E. Teitler, Michael D. Lieberman, and Jon Sperling. 2009. Twitterstand: News in tweets. In GIS. 42--51.
[31]
Sergej Sizov. 2010. GeoFolk: Latent spatial semantics in web 2.0 social media. In WSDM. 281--290.
[32]
Kazufumi Watanabe, Masanao Ochi, Makoto Okabe, and Rikio Onai. 2011. Jasmine: A real-time local-event detection system based on geolocation information propagated to microblogs. In CIKM. 2541--2544.
[33]
Jianshu Weng and Bu-Sung Lee. 2011. Event detection in twitter. In ICWSM. 401--408.
[34]
Zhijun Yin, Liangliang Cao, Jiawei Han, Chengxiang Zhai, and Thomas S. Huang. 2011. Geographical topic discovery and comparison. In WWW. 247--256.
[35]
Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, and Nadia Magnenat Thalmann. 2013. Who, where, when and what: Discover spatio-temporal topics for twitter users. In KDD. 605--613.
[36]
Quan Yuan, Wei Zhang, Chao Zhang, Xinhe Geng, Gao Cong, and Jiawei Han. 2017. PRED: Periodic region detection for mobility modeling of social media users. In WSDM. 263--272.
[37]
Chao Zhang, Jiawei Han, Lidan Shou, Jiajun Lu, and Thomas F. La Porta. 2014. Splitter: Mining fine-grained sequential patterns in semantic trajectories. PVLDB 7, 9 (2014), 769--780.
[38]
Chao Zhang, Shan Jiang, Yucheng Chen, Yidan Sun, and Jiawei Han. 2015. Fast inbound top-K query for random walk with restart. In ECML/PKDD. 608--624.
[39]
Chao Zhang, Keyang Zhang, Quan Yuan, Haoruo Peng, Yu Zheng, Tim Hanratty, Shaowen Wang, and Jiawei Han. 2017. Regions, periods, activities: Uncovering urban dynamics via cross-modal representation learning. In WWW.
[40]
Chao Zhang, Keyang Zhang, Quan Yuan, Luming Zhang, Tim Hanratty, and Jiawei Han. 2016. GMove: Group-level mobility modeling using geo-tagged social media. In KDD. 1305--1314.
[41]
Chao Zhang, Guangyu Zhou, Quan Yuan, Honglei Zhuang, Yu Zheng, Lance M. Kaplan, Shaowen Wang, and Jiawei Han. 2016. GeoBurst: Real-time local event detection in geo-tagged tweet streams. In SIGIR. 513--522.
[42]
Liang Zhao, Feng Chen, Chang-Tien Lu, and Naren Ramakrishnan. 2016. Multi-resolution spatial event forecasting in social media. In KDD.
[43]
Liang Zhao, Qian Sun, Jieping Ye, Feng Chen, Chang-Tien Lu, and Naren Ramakrishnan. 2015. Multi-task learning for spatio-temporal event forecasting. In KDD. 1503--1512.
[44]
Liang Zhao, Jieping Ye, Feng Chen, Chang-Tien Lu, and Naren Ramakrishnan. 2016. Hierarchical incomplete multi-source feature learning for spatiotemporal event forecasting. In KDD. 2085--2094.
[45]
Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban computing: Concepts, methodologies, and applications. ACM TIST 5, 3 (2014), 38:1--38:55.

Cited By

View all
  • (2024)SemConvTree: Semantic Convolutional Quadtrees for Multi-Scale Event Detection in Smart CitySmart Cities10.3390/smartcities70501077:5(2763-2780)Online publication date: 28-Sep-2024
  • (2024)ContCommRTD: A Distributed Content-Based Misinformation-Aware Community Detection System for Real-Time Disaster ReportingIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.341723236:11(5811-5822)Online publication date: Nov-2024
  • (2024)conteNXt: A Graph-Based Approach to Assimilate Content and Context for Event Detection in OSNIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.337239911:4(5483-5495)Online publication date: Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology
ACM Transactions on Intelligent Systems and Technology  Volume 9, Issue 3
Regular Papers and Special Issue: Urban Intelligence
May 2018
370 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/3167125
  • Editor:
  • Yu Zheng
Issue’s Table of Contents
© 2018 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the United States Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 January 2018
Accepted: 01 March 2017
Revised: 01 February 2017
Received: 01 November 2016
Published in TIST Volume 9, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Event detection
  2. data stream
  3. local event
  4. location-based service
  5. social media
  6. spatiotemporal data mining

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)SemConvTree: Semantic Convolutional Quadtrees for Multi-Scale Event Detection in Smart CitySmart Cities10.3390/smartcities70501077:5(2763-2780)Online publication date: 28-Sep-2024
  • (2024)ContCommRTD: A Distributed Content-Based Misinformation-Aware Community Detection System for Real-Time Disaster ReportingIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.341723236:11(5811-5822)Online publication date: Nov-2024
  • (2024)conteNXt: A Graph-Based Approach to Assimilate Content and Context for Event Detection in OSNIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.337239911:4(5483-5495)Online publication date: Aug-2024
  • (2023)Real-Time Detection of COVID-19 Events From Twitter: A Spatial-Temporally Bursty-Aware MethodIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.316974210:2(656-672)Online publication date: Apr-2023
  • (2023)A Method Based on Entity Interaction Graph for Detecting Social Events Using GDELT2023 11th International Conference on Intelligent Computing and Wireless Optical Communications (ICWOC)10.1109/ICWOC57905.2023.10200684(67-75)Online publication date: 16-Jun-2023
  • (2022)Development of an intelligent recognition system for dynamic mid-air gesticulation of isolated alphanumeric keysExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118623210:COnline publication date: 30-Dec-2022
  • (2022)A multi-components approach to monitoring process structure and customer behaviour concept driftExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118533210:COnline publication date: 30-Dec-2022
  • (2022)A machine learning framework to predict kidney graft failure with class imbalance using Red Deer algorithmExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118515210:COnline publication date: 30-Dec-2022
  • (2022)Strategic response for ease of doing business using case-based reasoningExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118514210:COnline publication date: 30-Dec-2022
  • (2022)An end-to-end framework for information extraction from Italian resumesExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118487210:COnline publication date: 30-Dec-2022
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media