Skip to main content
Log in

A survey of event detection techniques in online social networks

  • Review Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

The online social networks (OSNs) have become an important platform for detecting real-world event in recent years. These real-world events are detected by analyzing huge social-stream data available on different OSN platforms. Event detection has become significant because it contains substantial information which describes different scenarios during events or crisis. This information further helps to enable contextual decision making, regarding the event location, content and the temporal specifications. Several studies exist, which offers plethora of frameworks and tools for detecting and analyzing events used for applications like crisis management, monitoring and predicting events in different OSN platforms. In this paper, a survey is done for event detection techniques in OSN based on social text streams—newswire, web forums, emails, blogs and microblogs, for natural disasters, trending or emerging topics and public opinion-based events. The work done and the open problems are explicitly mentioned for each social stream. Further, this paper elucidates the list of event detection tools available for the researchers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Abel F, Hauff C, Houben GJ, Stronkman R, Tao K (2012) Twitcident: fighting fire with information from social web streams. In: Proceedings of the 21st international ACM conference companion on world wide web 305–308. doi: 10.1145/2187980.2188035

  • Agarwal N, Liu H (2008) Blogosphere: research issues, tools, and applications. ACM SIGKDD Explor Newsl 10(1):18–31. doi:10.1145/1412734.1412737

    Article  Google Scholar 

  • Agarwal N, Liu H, Tang L, Yu PS (2008) Identifying the influential bloggers in a community. In: Proceedings of the 2008 ACM international conference on web search and data mining (WSDM’08) 207–218. doi:10.1145/1341531.1341559

  • Aggarwal C, Subbian K (2012) Event detection in social streams. In: Proceedings of the 2012 SIAM international conference on data mining 12:624–635

  • Aggarwal CC, Zhai C (2012) Mining text data. Springer, Berlin

    Book  Google Scholar 

  • Ahn D (2006) The stages of event extraction. In: Proceedings of the workshop on annotating and reasoning about time and events. Association for Computational Linguistics 1–8

  • Allan J, Papka R, Lavrenko V (1998) On-line new event detection and tracking. In: Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval 37–45. doi:10.1145/290941.290954

  • Apache Software Foundation (2016) Apache Spark. http://spark.apache.org/. Accessed 1 Nov 2016

  • Atefeh F, Khreich W (2015) A survey of techniques for event detection in twitter. Comput Intell 31(1):132–164. doi:10.1111/coin.12017

    Article  MathSciNet  Google Scholar 

  • Balazinska M (2007) Event detection in mobile sensor networks. In: National Science Foundation (NSF) workshop on data management for mobile sensor networks 2007 (MobiSensors)

  • Bamrah NH, Satpute BS, Patil P (2014) Web forum crawling techniques. Int J Comput Appl 85:17

    Google Scholar 

  • Bansal N, Koudas N (2007) Blogscope: spatio-temporal Analysis of the blogosphere. In: Proceedings of the 16th international ACM conference on world wide web 1269–1270. doi:10.1145/1242572.1242802

  • Becker H, Naaman M, Gravano L (2011) Beyond trending topics: real-world event Identification on twitter. Int Conf Web Soc Media 11:438–441

    Google Scholar 

  • Benson E, Haghighi A, Barzilay R (2011) Event discovery in social media feeds. In: Proceedings of the 49th annual meeting of the association for computational linguistics. Human Language Technologies 1:389–398

  • Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  • Chen CC, Chen MC (2008) TSCAN: A novel method for topic summarization and content anatomy. In: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval 579–586. doi:10.1145/1390334.1390433

  • Chen Y, Yang S, Cheng X (2009) Bursty topics extraction for web forums. In: Proceedings of the eleventh international ACM workshop on Web information and data management 55–58

  • Chen F, Du J, Qian W, Zhou A (2012) Topic detection over online forum. In: Web information systems and applications IEEE ninth conference (WISA) 235–240

  • Cheng V, Li CH (2007) Topic detection via participation using Markov logic network. In: Signal-image technologies and internet based system. Third international IEEE conference, 85–91

  • Chester TLS, Taylor M, Sandhu J, Forsting S, Ellis A, Stirling R, Galanis E (2011) Use of a web forum and an online questionnaire in the detection and investigation of an outbreak. Online journal of public healthinformatics3.1

  • Cisco VNI (2014) The zettabyte era: trends and analysis. Updated (29/05/2013). http://www.cisco.com/c/en/us/solutions/collateral/serviceprovider/visual-networking-index-vni/VNI_Hyperconnectivity_WP.html. Accessed Jan 2016

  • Cordeiro M (2012) Twitter event detection: combining wavelet analysis and topic inference summarization. In: Doctoral Symposium on Informatics Engineering (DSIE’2012)

  • Dasigi P, Hovy EH (2014) Modeling newswire events using neural networks for anomaly detection. In: 25th International Conference on Computational Linguistics (COLING 2014) 1414–1422

  • Deitrick W, Hu W (2013) Mutually enhancing community detection and sentiment analysis on twitter networks. J Data Anal Inf Process 1:19–29

    Google Scholar 

  • Dereszynski E, Dietterich T (2007) Probabilistic models for anomaly detection in remote sensor data streams. In: Proceedings of the 23rd conference on Uncertainty in Artificial Intelligence (UAI-2007) 75–82

  • Devi KN, Bhaskaran VM (2015) Online forums hotspot detection and analysis using aging theory. World Academy Sci Eng Technol Int J Comp Electr Autom Control Inf Eng 9(4):913–917

    Google Scholar 

  • Facebook (2016). www.facebook.com. Accessed December 2015

  • Fox D, Hightower J, Liao L, Schulz D, Borriello G (2003) Bayesian filtering for location estimation. IEEE Pervasive Comput 3:24–33

    Article  Google Scholar 

  • Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Annals Stat 29:1189–1232

  • Fung GPC, Yu JX, Yu PS, Lu H (2005) Parameter free bursty events detection in text streams. In: Proceedings of the 31st international conference on Very large data bases 181–192

  • Gill KE (2005) Blogging, RSS and the information landscape: a look at online news. In: WWW 2005 workshop on the weblogging ecosystem

  • Gionis A, Indyk P, Motwani R (1999) Similarity search in high dimensions via hashing. In: Very large data bases 1999 Sep 7 (VLDB) 99:518–529

  • Goller C, Kuchler A (1996) Learning task-dependent distributed representations by Backpropagation through structure. In: Neural Networks. 1996 IEEE International Conference 1:347–352

  • Grishman R, Westbrook D, Meyers A (2005) NYU’s english ACE 2005 system description. In: Proceedings of ACE 2005 Evaluation Workshop, Washington

  • Gu H, Xie X, Lv Q, Ruan Y, Shang L (2011) Etree: effective and efficient event modeling for real-time online social media networks. In: Web Intelligence and Intelligent Agent Technology (WI-IAT). 2011 IEEE/WIC/ACM International Conference 1:300–307

  • Guralnik V, Srivastava J (1999) Event detection from time series data. In: Proceedings of the fifth ACM SIGKDD international conference on knowledge discovery and data mining 33–42

  • Halliday M, Hasan R (1976) Cohesion in english. Longman, London

    Google Scholar 

  • Hardy H, Kanchakouskaya V, Strzalkowski T (2006) Automatic event classification using surface text features. In: Proc. AAAI06 workshop on event extraction and synthesis 36–41

  • He Q, Chang K, Lim EP (2007) Analyzing feature trajectories for event detection. In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in Information Retrieval 207–214

  • He Q, Chang K, Lim EP, Zhang J (2007) Bursty feature representation for clustering text streams. In: SIAM International Conference of Data Mining 491–496

  • Hennig P, Berger P, Kurzynski D, Rantzsch H, Meinel C (2014) Efficient event detection for the blogosphere. In: Big Data and Cloud Computing (BdCloud). 2014 IEEE Fourth International Conference 408–415

  • Hong Y, Zhang J, Ma B, Yao J, Zhou G, Zhu Q (2011) Using cross-entity inference to improve event extraction. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies 1:1127–1136)

  • Huang J, Iwaihara M (2011) Realtime social sensing of support rate for microblogging. In: 2011 International Springer Conference of Database Systems for Advanced Applications 357–368

  • Ihler A, Hutchins J, Smyth P (2006) Adaptive event detection with time-varying poisson processes. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 207–216

  • Jadhav AS, Purohit H, Kapanipathi P, Anantharam P, Ranabahu AH, Nguyen V, Sheth AP (2010) Twitris 2.0: semantically empowered system for understanding perceptions from social data. http://corescholar.libraries.wright.edu/knoesis/252. Accessed Oct 2016

  • Java A, Song X, Finin T, Tseng B (2007) Why we twitter: understanding microblogging usage and communities. In: Proceedings of the 9th WebKDD and 1st ACM SNA-KDD 2007 workshop on Web mining and Social Network Analysis 56–65

  • Ji H, Grishman R (2008) Refining event extraction through cross-document inference. In: Association for Computational Linguistics (ACL) 254–262

  • Joachims T (1998) Text categorization with support vector machines: learning with many relevant features. In: Proceedings of the Tenth European Conference on Machine Learning 137–142

  • Jurgens D, Stevens K (2009) Event detection in blogs using temporal random indexing. In: Association for Computational Linguistics Proceedings of the Workshop on Events in Emerging Text Types 9–16

  • Kaplan AM, Haenlein M (2010) Users of the world, Unite! The challenges and opportunities of social media. Business Horizons 53(1):59–68

    Article  Google Scholar 

  • Kastner I, Monz C (2009) Automatic single-document key fact extraction from newswire articles. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics 415–423

  • Keogh EJ (2002) Exact indexing of dynamic time warping. Knowl Inf Syst 7(3):358–386

    Article  Google Scholar 

  • Li C, Sun A, Datta A (2012) Twevent: segment-based event detection from tweets. In: Proceedings of the 21st ACM international conference on information and knowledge management 155–164

  • CNN Library (2015) Mumbai terror attacks fast facts. http://edition.cnn.com/2013/09/18/world/asia/mumbai-terror-attacks/. Accessed Jan 2016

  • Kerman MC et al. (2009) Event detection challenges, methods, and applications in natural and artificial systems. In: Proceedings of 14th International Command and Control Research and Technology Symposium: “C2 and Agility”

  • Menon R. Gulati A (2010) Spatial—Temporal random indexing for event detection in newswire data, http://ankushgulati.weebly.com/uploads/6/0/3/6/6036818/final_report.pdf, Accessed Jan 2016

  • Khreich W, Granger E, Miri A, Sabourin R (2012) A survey of techniques for incremental learning of HMM parameters. Inf Sci 197:105–130

    Article  Google Scholar 

  • Kleinberg J (2006) Data stream management: processing high-speed data streams. Chapter temporal dynamics of on-line information streams. Springer, Berlin

    Google Scholar 

  • Kumar R, Novak J, Raghavan P, Tomkins A (2004) Structure and evolution of blogspace. Commun ACM 47(12):35–39

    Article  Google Scholar 

  • Kumar R, Novak J, Raghavan P, Tomkins A (2005) On the bursty evolution of blogspace. World Wide Web 8(2):159–178

    Article  Google Scholar 

  • Kumar S, Barbier G, Abbasi MA, Liu H (2011) TweetTracker: an analysis tool for humanitarian and disaster relief. In: International Conference on Web and Social Media (ICWSM)2011 Jul 5

  • Kumaran G, Allan J (2004) Text classification and named entities for new event detection. In: Proceedings of the 27th Annual international ACM SIGIR conference on Research and development in information retrieval 297–304

  • Lam W, Meng HML, Wong KL, Yen JCH (2001) Using contextual analysis for news event detection. Int J Intell Syst 16(4):525–546

    Article  MATH  Google Scholar 

  • Leskovec J, Backstrom L, Kleinberg J (2009) Meme-tracking and the dynamics of the news cycle. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge Discovery and Data Mining 497–506

  • Li R, Lei KH, Khadiwala R, Chang KCC (2012) Tedas: a twitter-based event detection and analysis system. In: Data engineering (icde), 2012 IEEE 28th international conference 1273–1276

  • Li Q, Ji H, Huang L (2013) Joint event extraction via structured prediction with global features. Assoc Comput Linguist 1:73–82

    Google Scholar 

  • Li J, Tai Z, Zhang R, Yu W, Liu L (2014) Online bursty event detection from microblog. In: Utility and Cloud Computing (UCC) 2014 IEEE/ACM 7th International Conference 865–870

  • Liao S, Grishman R (2010) Using document level cross-event inference to improve event extraction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics 789–797

  • MacEachren AM, Jaiswal A, Robinson AC, Pezanowski S, Savelyev A, Mitra P, Blanford J (2011) Senseplace2: geotwitter analytics support for situational awareness. In: Visual Analytics Science and Technology (VAST), 2011 IEEE Conference 181–190

  • Madani A, Boussaid O, Zegour DE (2014) What’s happening: a survey of tweets event detection. In: Proceedings of the 3rd International Conference on Communication, Computation, Networks and Technologies INNOV2014 16–22

  • MarcSmith (2016) NodeXL: Network overview, discovery and exploration of excel. http://nodexl.codeplex.com/, Accessed Jan 2016

  • Marcus A, Bernstein MS, Badar O, Karger DR, Madden S, Miller RC (2011) Twitinfo: aggregating and visualizing microblogs for event exploration. In: Proceedings of the ACM SIGCHI conference on Human factors in computing systems 227–236

  • Margineantu D, Wong WK, Dash D (2010) Machine learning algorithms for event detection: A special issue of Machine Learning Journal. Springer 79: 257–259

  • Maslennikov M, Chua TS (2007) June) A Multi-Resolution Framework for Information Extraction from Free Text. Annual Meeting-Association for Computational Linguistics 45(1):592

    Google Scholar 

  • Massoudi K, Tsagkias M, De Rijke M, Weerkamp W (2011) Incorporating query expansion and quality indicators in searching microblog posts. In: European Springer Conference on Advances in information retrieval 362–367

  • Mathioudakis M, Koudas N (2010) Twittermonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data 1155–1158

  • McCreadie R., Macdonald C, Ounis I, Osborne M, Petrovic S (2013) Scalable distributed event detection for twitter. In: Big Data, 2013 IEEE International Conference 543–549

  • Metzler D, Bernstein Y, Croft WB, Moffat A, Zobel J (2005) The recap system for identifying information flow. In: Proceedings of the 28th Annual International ACM Special Interest Group of Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval 678–678

  • Metzler D, Cai C, Hovy E (2012) Structured event retrieval over microblog archives. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 646–655

  • Miwa M, Thompson P, Korkontzelos I, Ananiadou S (2014). Comparable study of event extraction in newswire and biomedical domains. In: COLING 2270–2279

  • Morstatter F, Kumar S, Liu H, Maciejewski R (2013) Understanding twitter data with tweetxplorer. In: Proceedings of the 19th ACM SIGKDD International conference on Knowledge discovery and data mining 1482–1485

  • Neill DB, Gorr WL (2007) Detecting and preventing emerging epidemics of crime. Advances in Disease Surveillance 4:13

    Google Scholar 

  • Neill DB, Wong WK (2009) Tutorial on Event Detection. KDD

  • Newswires (2015) https://www.newswire.com/, Accessed Feb 2016

  • Nurwidyantoro A, Winarko E (2013) Event detection in social media: a survey. In: ICT for Smart Society (ICISS). 2013 IEEE International Conference 1–5

  • Osborne M, Moran S, McCreadie R, Von Lunen A, Sykora M D, Cano E, Jackson T (2014) Real-time detection, tracking, and monitoring of automatically discovered events in social media. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations. ACL 2014 37–42

  • Long R, Wang H, Chen Y, Jin O, Yu Y (2011) Towards effective event detection, tracking and summarization on microblog data. In: International Springer Conference on Web-Age Information Management 652–663

  • Papadopoulos et al. (2015) Social sensor. Report. http://www.socialsensor.eu/images/wp1_evaluation_report.pdf. Accessed Jan 2016

  • Papka R, Allan J (1998) On-line new event detection using single pass clustering title2. Technical Report. University of Massachusetts

  • Patwardhan S, Riloff E (2009) A unified model of phrasal and sentential evidence for information extraction. In: Proceedings of the 2009. Conference on Empirical Methods in Natural Language Processing of Association for Computational Linguistics 1:151–160

  • Pereira Nunes B, Mera A, Kawase R, Fetahu B, Casanova MA, de Campos GHB (2014) A topic extraction process for online forums. In: Advanced Learning Technologies (ICALT). 2014 IEEE 14th International Conference 541–543

  • Petrović S, Osborne M, Lavrenko V (2010) Streaming first story detection with application to twitter. In: Human Language Technologies. The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics 181–189

  • Popescu AM, Pennacchiotti M (2010) Detecting controversial events from twitter. In: Proceedings of the 19th ACM international conference on Information and knowledge management 1873–1876

  • Popescu AM, Pennacchiotti M, Paranjpe D (2011) Extracting events and event descriptions from twitter. In: Proceedings of the 20th ACM International conference companion on World Wide Web 105–106

  • Purohit H, Sheth AP (2013) Twitris v3: from citizen sensing to analysis, coordination and action. In: International Conference of Weblogs and Social Media (ICWSM) 2013 Jul

  • Qi Y, Candan KS (2006) Cuts: curvature-based development pattern analysis and segmentation for blogs and other text streams. In: Proceedings of the 17th ACM Conference on Hypertext and Hypermedia 1–10

  • Richardson M, Domingos P (2006) Markov logic networks. Mach Learn 62(1–2):107–136

    Article  Google Scholar 

  • Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th ACM International conference on World Wide Web 851–860

  • Sankaranarayanan J, Samet H, Teitler BE, Lieberman MD, Sperling J (2009) Twitterstand: news in tweets. In: Proceedings of the 17th ACM Sigspatial International conference on Advances in Geographic information systems 42–51

  • Sayyadi H, Hurst M, Maykov A (2009) Event detection and tracking in social streams. In: Proceedings of the 3rd International Conference of Weblogs and Social Media (ICWSM) 17–20

  • Schubotz T, Krestel R (2015) Online temporal summarization of news events. In: 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology 1:409–412

  • Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905

    Article  Google Scholar 

  • Song X, Tseng BL, Lin CY, Sun MT (2006) Personalized recommendation driven by information flow. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 509–516

  • Stewart A, Smith M, Nejdl W (2011) A transfer approach to detecting disease reporting events in blog social media. In: Proceedings of the 22nd ACM conference on Hypertext and Hypermedia 271–280

  • Stuart TL, Sandhu J, Stirling R, Corder J, Ellis A (2010) Campylobacteriosis outbreak associated with ingestion of mud during a mountain bike race. Epidemiol Infect 138(12):1695–1703

    Article  Google Scholar 

  • Tork H. (2011). Event Detection. Thesis. Laboratory of Artificial Intelligence and Decision Support (LIAAD-INESC TEC)

  • Trendsmap (2015) http://trendsmap.com/, Accessed January 2016

  • Tseng BL, Tatemura J, Wu Y (2005) Tomographic clustering to visualize blog communities as mountain views. In: WWW 2005 Workshop on the weblogging ecosystem

  • Twitter (2016) www.twitter.com, Accessed December 2015

  • Ushahidi (2008) https://www.ushahidi.com/, Accessed January 2016

  • Wan X, Milios E, Kalyaniwalla N, Janssen J (2009) Link-based event detection in email communication networks. In: Proceedings of the 2009 ACM symposium on Applied Computing 1506–1510

  • Wasi S, Shaikh ZA, Shamsi J (2011) Contextual event information extractor for emails. Sindh University Research Journal (SURJ) (Science Series), 43(1(a))

  • Weng J, Lee BS (2011) Event detection in twitter. Int Conf Weblogs Soc Media (ICWSM) 11:401–408

    Google Scholar 

  • Wikipedia (2016) https://en.wikipedia.org/wiki/Wikipedia. Accessed 1November 2016

  • Xie Y (2011) Report on the public opinions and crisis management report. Social Science Literature Press, Beijing, 1–12 (in Chinese)

  • Xie W, Zhu F, Jiang J, Lim EP, Wang K (2013) Topicsketch: real-time bursty topic detection from twitter. In: 2013 IEEE 13th International Conference on Data Mining 837–846

  • Yang Y, Pierce T, Carbonell J (1998) A study of retrospective and on-line event detection. In: Proceedings of the 21st annual international ACM SIGIR Conference on Research and development in Information Retrieval 28–36

  • Yang Y, Carbonell JG, Brown RD, Pierce T, Archibald BT, Liu X (1999) Learning approaches for detecting and tracking news events. IEEE Intell Syst 14:32–43. doi:10.1109/5254.784083

    Article  Google Scholar 

  • Youtube (2016) www.youtube.com, Accessed December 2015

  • Zhao Q, Mitra P (2007). Event detection and visualization for social text streams. In: International Conference of Weblogs and Social Media. ICWSM

  • Zhao Q, Mitra P, Chen B (2007) Temporal and information flow based event detection from social text streams. In: Proceedings of the 22nd National Conference on Artificial Intelligence 2: 1501–1506

  • Zhao J, Wang X, Ma Z (2014) Towards events detection from microblog messages. Int J Hybrid Inf Technol 7(1):201–210

    Article  Google Scholar 

  • Zhi Li Wu, Chun Hung Li (2007) Topic detection in online discussion using non-negative matrix factorization. In: Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology Workshops 272–275

  • Zhou X, Chen L (2014) Event Detection over twitter social media streams. VLDB J 23(3):381–400. doi:10.1007/s00778-013-0320-3

    Article  Google Scholar 

  • Zhou D, Chen L, He Y (2014) A simple bayesian modelling approach to event extraction from twitter. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, Maryland, USA 700–705

  • Zhu M, Hu W, Wu O (2008) Topic detection and tracking for threaded discussion communities. In: Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology 1:77–83

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ajey Kumar.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Goswami, A., Kumar, A. A survey of event detection techniques in online social networks. Soc. Netw. Anal. Min. 6, 107 (2016). https://doi.org/10.1007/s13278-016-0414-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-016-0414-1

Keywords

Navigation