Abstract
The online social networks (OSNs) have become an important platform for detecting real-world event in recent years. These real-world events are detected by analyzing huge social-stream data available on different OSN platforms. Event detection has become significant because it contains substantial information which describes different scenarios during events or crisis. This information further helps to enable contextual decision making, regarding the event location, content and the temporal specifications. Several studies exist, which offers plethora of frameworks and tools for detecting and analyzing events used for applications like crisis management, monitoring and predicting events in different OSN platforms. In this paper, a survey is done for event detection techniques in OSN based on social text streams—newswire, web forums, emails, blogs and microblogs, for natural disasters, trending or emerging topics and public opinion-based events. The work done and the open problems are explicitly mentioned for each social stream. Further, this paper elucidates the list of event detection tools available for the researchers.
Similar content being viewed by others
References
Abel F, Hauff C, Houben GJ, Stronkman R, Tao K (2012) Twitcident: fighting fire with information from social web streams. In: Proceedings of the 21st international ACM conference companion on world wide web 305–308. doi: 10.1145/2187980.2188035
Agarwal N, Liu H (2008) Blogosphere: research issues, tools, and applications. ACM SIGKDD Explor Newsl 10(1):18–31. doi:10.1145/1412734.1412737
Agarwal N, Liu H, Tang L, Yu PS (2008) Identifying the influential bloggers in a community. In: Proceedings of the 2008 ACM international conference on web search and data mining (WSDM’08) 207–218. doi:10.1145/1341531.1341559
Aggarwal C, Subbian K (2012) Event detection in social streams. In: Proceedings of the 2012 SIAM international conference on data mining 12:624–635
Aggarwal CC, Zhai C (2012) Mining text data. Springer, Berlin
Ahn D (2006) The stages of event extraction. In: Proceedings of the workshop on annotating and reasoning about time and events. Association for Computational Linguistics 1–8
Allan J, Papka R, Lavrenko V (1998) On-line new event detection and tracking. In: Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval 37–45. doi:10.1145/290941.290954
Apache Software Foundation (2016) Apache Spark. http://spark.apache.org/. Accessed 1 Nov 2016
Atefeh F, Khreich W (2015) A survey of techniques for event detection in twitter. Comput Intell 31(1):132–164. doi:10.1111/coin.12017
Balazinska M (2007) Event detection in mobile sensor networks. In: National Science Foundation (NSF) workshop on data management for mobile sensor networks 2007 (MobiSensors)
Bamrah NH, Satpute BS, Patil P (2014) Web forum crawling techniques. Int J Comput Appl 85:17
Bansal N, Koudas N (2007) Blogscope: spatio-temporal Analysis of the blogosphere. In: Proceedings of the 16th international ACM conference on world wide web 1269–1270. doi:10.1145/1242572.1242802
Becker H, Naaman M, Gravano L (2011) Beyond trending topics: real-world event Identification on twitter. Int Conf Web Soc Media 11:438–441
Benson E, Haghighi A, Barzilay R (2011) Event discovery in social media feeds. In: Proceedings of the 49th annual meeting of the association for computational linguistics. Human Language Technologies 1:389–398
Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
Chen CC, Chen MC (2008) TSCAN: A novel method for topic summarization and content anatomy. In: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval 579–586. doi:10.1145/1390334.1390433
Chen Y, Yang S, Cheng X (2009) Bursty topics extraction for web forums. In: Proceedings of the eleventh international ACM workshop on Web information and data management 55–58
Chen F, Du J, Qian W, Zhou A (2012) Topic detection over online forum. In: Web information systems and applications IEEE ninth conference (WISA) 235–240
Cheng V, Li CH (2007) Topic detection via participation using Markov logic network. In: Signal-image technologies and internet based system. Third international IEEE conference, 85–91
Chester TLS, Taylor M, Sandhu J, Forsting S, Ellis A, Stirling R, Galanis E (2011) Use of a web forum and an online questionnaire in the detection and investigation of an outbreak. Online journal of public healthinformatics3.1
Cisco VNI (2014) The zettabyte era: trends and analysis. Updated (29/05/2013). http://www.cisco.com/c/en/us/solutions/collateral/serviceprovider/visual-networking-index-vni/VNI_Hyperconnectivity_WP.html. Accessed Jan 2016
Cordeiro M (2012) Twitter event detection: combining wavelet analysis and topic inference summarization. In: Doctoral Symposium on Informatics Engineering (DSIE’2012)
Dasigi P, Hovy EH (2014) Modeling newswire events using neural networks for anomaly detection. In: 25th International Conference on Computational Linguistics (COLING 2014) 1414–1422
Deitrick W, Hu W (2013) Mutually enhancing community detection and sentiment analysis on twitter networks. J Data Anal Inf Process 1:19–29
Dereszynski E, Dietterich T (2007) Probabilistic models for anomaly detection in remote sensor data streams. In: Proceedings of the 23rd conference on Uncertainty in Artificial Intelligence (UAI-2007) 75–82
Devi KN, Bhaskaran VM (2015) Online forums hotspot detection and analysis using aging theory. World Academy Sci Eng Technol Int J Comp Electr Autom Control Inf Eng 9(4):913–917
Facebook (2016). www.facebook.com. Accessed December 2015
Fox D, Hightower J, Liao L, Schulz D, Borriello G (2003) Bayesian filtering for location estimation. IEEE Pervasive Comput 3:24–33
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Annals Stat 29:1189–1232
Fung GPC, Yu JX, Yu PS, Lu H (2005) Parameter free bursty events detection in text streams. In: Proceedings of the 31st international conference on Very large data bases 181–192
Gill KE (2005) Blogging, RSS and the information landscape: a look at online news. In: WWW 2005 workshop on the weblogging ecosystem
Gionis A, Indyk P, Motwani R (1999) Similarity search in high dimensions via hashing. In: Very large data bases 1999 Sep 7 (VLDB) 99:518–529
Goller C, Kuchler A (1996) Learning task-dependent distributed representations by Backpropagation through structure. In: Neural Networks. 1996 IEEE International Conference 1:347–352
Grishman R, Westbrook D, Meyers A (2005) NYU’s english ACE 2005 system description. In: Proceedings of ACE 2005 Evaluation Workshop, Washington
Gu H, Xie X, Lv Q, Ruan Y, Shang L (2011) Etree: effective and efficient event modeling for real-time online social media networks. In: Web Intelligence and Intelligent Agent Technology (WI-IAT). 2011 IEEE/WIC/ACM International Conference 1:300–307
Guralnik V, Srivastava J (1999) Event detection from time series data. In: Proceedings of the fifth ACM SIGKDD international conference on knowledge discovery and data mining 33–42
Halliday M, Hasan R (1976) Cohesion in english. Longman, London
Hardy H, Kanchakouskaya V, Strzalkowski T (2006) Automatic event classification using surface text features. In: Proc. AAAI06 workshop on event extraction and synthesis 36–41
He Q, Chang K, Lim EP (2007) Analyzing feature trajectories for event detection. In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in Information Retrieval 207–214
He Q, Chang K, Lim EP, Zhang J (2007) Bursty feature representation for clustering text streams. In: SIAM International Conference of Data Mining 491–496
Hennig P, Berger P, Kurzynski D, Rantzsch H, Meinel C (2014) Efficient event detection for the blogosphere. In: Big Data and Cloud Computing (BdCloud). 2014 IEEE Fourth International Conference 408–415
Hong Y, Zhang J, Ma B, Yao J, Zhou G, Zhu Q (2011) Using cross-entity inference to improve event extraction. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies 1:1127–1136)
Huang J, Iwaihara M (2011) Realtime social sensing of support rate for microblogging. In: 2011 International Springer Conference of Database Systems for Advanced Applications 357–368
Ihler A, Hutchins J, Smyth P (2006) Adaptive event detection with time-varying poisson processes. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 207–216
Jadhav AS, Purohit H, Kapanipathi P, Anantharam P, Ranabahu AH, Nguyen V, Sheth AP (2010) Twitris 2.0: semantically empowered system for understanding perceptions from social data. http://corescholar.libraries.wright.edu/knoesis/252. Accessed Oct 2016
Java A, Song X, Finin T, Tseng B (2007) Why we twitter: understanding microblogging usage and communities. In: Proceedings of the 9th WebKDD and 1st ACM SNA-KDD 2007 workshop on Web mining and Social Network Analysis 56–65
Ji H, Grishman R (2008) Refining event extraction through cross-document inference. In: Association for Computational Linguistics (ACL) 254–262
Joachims T (1998) Text categorization with support vector machines: learning with many relevant features. In: Proceedings of the Tenth European Conference on Machine Learning 137–142
Jurgens D, Stevens K (2009) Event detection in blogs using temporal random indexing. In: Association for Computational Linguistics Proceedings of the Workshop on Events in Emerging Text Types 9–16
Kaplan AM, Haenlein M (2010) Users of the world, Unite! The challenges and opportunities of social media. Business Horizons 53(1):59–68
Kastner I, Monz C (2009) Automatic single-document key fact extraction from newswire articles. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics 415–423
Keogh EJ (2002) Exact indexing of dynamic time warping. Knowl Inf Syst 7(3):358–386
Li C, Sun A, Datta A (2012) Twevent: segment-based event detection from tweets. In: Proceedings of the 21st ACM international conference on information and knowledge management 155–164
CNN Library (2015) Mumbai terror attacks fast facts. http://edition.cnn.com/2013/09/18/world/asia/mumbai-terror-attacks/. Accessed Jan 2016
Kerman MC et al. (2009) Event detection challenges, methods, and applications in natural and artificial systems. In: Proceedings of 14th International Command and Control Research and Technology Symposium: “C2 and Agility”
Menon R. Gulati A (2010) Spatial—Temporal random indexing for event detection in newswire data, http://ankushgulati.weebly.com/uploads/6/0/3/6/6036818/final_report.pdf, Accessed Jan 2016
Khreich W, Granger E, Miri A, Sabourin R (2012) A survey of techniques for incremental learning of HMM parameters. Inf Sci 197:105–130
Kleinberg J (2006) Data stream management: processing high-speed data streams. Chapter temporal dynamics of on-line information streams. Springer, Berlin
Kumar R, Novak J, Raghavan P, Tomkins A (2004) Structure and evolution of blogspace. Commun ACM 47(12):35–39
Kumar R, Novak J, Raghavan P, Tomkins A (2005) On the bursty evolution of blogspace. World Wide Web 8(2):159–178
Kumar S, Barbier G, Abbasi MA, Liu H (2011) TweetTracker: an analysis tool for humanitarian and disaster relief. In: International Conference on Web and Social Media (ICWSM)2011 Jul 5
Kumaran G, Allan J (2004) Text classification and named entities for new event detection. In: Proceedings of the 27th Annual international ACM SIGIR conference on Research and development in information retrieval 297–304
Lam W, Meng HML, Wong KL, Yen JCH (2001) Using contextual analysis for news event detection. Int J Intell Syst 16(4):525–546
Leskovec J, Backstrom L, Kleinberg J (2009) Meme-tracking and the dynamics of the news cycle. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge Discovery and Data Mining 497–506
Li R, Lei KH, Khadiwala R, Chang KCC (2012) Tedas: a twitter-based event detection and analysis system. In: Data engineering (icde), 2012 IEEE 28th international conference 1273–1276
Li Q, Ji H, Huang L (2013) Joint event extraction via structured prediction with global features. Assoc Comput Linguist 1:73–82
Li J, Tai Z, Zhang R, Yu W, Liu L (2014) Online bursty event detection from microblog. In: Utility and Cloud Computing (UCC) 2014 IEEE/ACM 7th International Conference 865–870
Liao S, Grishman R (2010) Using document level cross-event inference to improve event extraction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics 789–797
MacEachren AM, Jaiswal A, Robinson AC, Pezanowski S, Savelyev A, Mitra P, Blanford J (2011) Senseplace2: geotwitter analytics support for situational awareness. In: Visual Analytics Science and Technology (VAST), 2011 IEEE Conference 181–190
Madani A, Boussaid O, Zegour DE (2014) What’s happening: a survey of tweets event detection. In: Proceedings of the 3rd International Conference on Communication, Computation, Networks and Technologies INNOV2014 16–22
MarcSmith (2016) NodeXL: Network overview, discovery and exploration of excel. http://nodexl.codeplex.com/, Accessed Jan 2016
Marcus A, Bernstein MS, Badar O, Karger DR, Madden S, Miller RC (2011) Twitinfo: aggregating and visualizing microblogs for event exploration. In: Proceedings of the ACM SIGCHI conference on Human factors in computing systems 227–236
Margineantu D, Wong WK, Dash D (2010) Machine learning algorithms for event detection: A special issue of Machine Learning Journal. Springer 79: 257–259
Maslennikov M, Chua TS (2007) June) A Multi-Resolution Framework for Information Extraction from Free Text. Annual Meeting-Association for Computational Linguistics 45(1):592
Massoudi K, Tsagkias M, De Rijke M, Weerkamp W (2011) Incorporating query expansion and quality indicators in searching microblog posts. In: European Springer Conference on Advances in information retrieval 362–367
Mathioudakis M, Koudas N (2010) Twittermonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data 1155–1158
McCreadie R., Macdonald C, Ounis I, Osborne M, Petrovic S (2013) Scalable distributed event detection for twitter. In: Big Data, 2013 IEEE International Conference 543–549
Metzler D, Bernstein Y, Croft WB, Moffat A, Zobel J (2005) The recap system for identifying information flow. In: Proceedings of the 28th Annual International ACM Special Interest Group of Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval 678–678
Metzler D, Cai C, Hovy E (2012) Structured event retrieval over microblog archives. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 646–655
Miwa M, Thompson P, Korkontzelos I, Ananiadou S (2014). Comparable study of event extraction in newswire and biomedical domains. In: COLING 2270–2279
Morstatter F, Kumar S, Liu H, Maciejewski R (2013) Understanding twitter data with tweetxplorer. In: Proceedings of the 19th ACM SIGKDD International conference on Knowledge discovery and data mining 1482–1485
Neill DB, Gorr WL (2007) Detecting and preventing emerging epidemics of crime. Advances in Disease Surveillance 4:13
Neill DB, Wong WK (2009) Tutorial on Event Detection. KDD
Newswires (2015) https://www.newswire.com/, Accessed Feb 2016
Nurwidyantoro A, Winarko E (2013) Event detection in social media: a survey. In: ICT for Smart Society (ICISS). 2013 IEEE International Conference 1–5
Osborne M, Moran S, McCreadie R, Von Lunen A, Sykora M D, Cano E, Jackson T (2014) Real-time detection, tracking, and monitoring of automatically discovered events in social media. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations. ACL 2014 37–42
Long R, Wang H, Chen Y, Jin O, Yu Y (2011) Towards effective event detection, tracking and summarization on microblog data. In: International Springer Conference on Web-Age Information Management 652–663
Papadopoulos et al. (2015) Social sensor. Report. http://www.socialsensor.eu/images/wp1_evaluation_report.pdf. Accessed Jan 2016
Papka R, Allan J (1998) On-line new event detection using single pass clustering title2. Technical Report. University of Massachusetts
Patwardhan S, Riloff E (2009) A unified model of phrasal and sentential evidence for information extraction. In: Proceedings of the 2009. Conference on Empirical Methods in Natural Language Processing of Association for Computational Linguistics 1:151–160
Pereira Nunes B, Mera A, Kawase R, Fetahu B, Casanova MA, de Campos GHB (2014) A topic extraction process for online forums. In: Advanced Learning Technologies (ICALT). 2014 IEEE 14th International Conference 541–543
Petrović S, Osborne M, Lavrenko V (2010) Streaming first story detection with application to twitter. In: Human Language Technologies. The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics 181–189
Popescu AM, Pennacchiotti M (2010) Detecting controversial events from twitter. In: Proceedings of the 19th ACM international conference on Information and knowledge management 1873–1876
Popescu AM, Pennacchiotti M, Paranjpe D (2011) Extracting events and event descriptions from twitter. In: Proceedings of the 20th ACM International conference companion on World Wide Web 105–106
Purohit H, Sheth AP (2013) Twitris v3: from citizen sensing to analysis, coordination and action. In: International Conference of Weblogs and Social Media (ICWSM) 2013 Jul
Qi Y, Candan KS (2006) Cuts: curvature-based development pattern analysis and segmentation for blogs and other text streams. In: Proceedings of the 17th ACM Conference on Hypertext and Hypermedia 1–10
Richardson M, Domingos P (2006) Markov logic networks. Mach Learn 62(1–2):107–136
Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th ACM International conference on World Wide Web 851–860
Sankaranarayanan J, Samet H, Teitler BE, Lieberman MD, Sperling J (2009) Twitterstand: news in tweets. In: Proceedings of the 17th ACM Sigspatial International conference on Advances in Geographic information systems 42–51
Sayyadi H, Hurst M, Maykov A (2009) Event detection and tracking in social streams. In: Proceedings of the 3rd International Conference of Weblogs and Social Media (ICWSM) 17–20
Schubotz T, Krestel R (2015) Online temporal summarization of news events. In: 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology 1:409–412
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905
Song X, Tseng BL, Lin CY, Sun MT (2006) Personalized recommendation driven by information flow. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 509–516
Stewart A, Smith M, Nejdl W (2011) A transfer approach to detecting disease reporting events in blog social media. In: Proceedings of the 22nd ACM conference on Hypertext and Hypermedia 271–280
Stuart TL, Sandhu J, Stirling R, Corder J, Ellis A (2010) Campylobacteriosis outbreak associated with ingestion of mud during a mountain bike race. Epidemiol Infect 138(12):1695–1703
Tork H. (2011). Event Detection. Thesis. Laboratory of Artificial Intelligence and Decision Support (LIAAD-INESC TEC)
Trendsmap (2015) http://trendsmap.com/, Accessed January 2016
Tseng BL, Tatemura J, Wu Y (2005) Tomographic clustering to visualize blog communities as mountain views. In: WWW 2005 Workshop on the weblogging ecosystem
Twitter (2016) www.twitter.com, Accessed December 2015
Ushahidi (2008) https://www.ushahidi.com/, Accessed January 2016
Wan X, Milios E, Kalyaniwalla N, Janssen J (2009) Link-based event detection in email communication networks. In: Proceedings of the 2009 ACM symposium on Applied Computing 1506–1510
Wasi S, Shaikh ZA, Shamsi J (2011) Contextual event information extractor for emails. Sindh University Research Journal (SURJ) (Science Series), 43(1(a))
Weng J, Lee BS (2011) Event detection in twitter. Int Conf Weblogs Soc Media (ICWSM) 11:401–408
Wikipedia (2016) https://en.wikipedia.org/wiki/Wikipedia. Accessed 1November 2016
Xie Y (2011) Report on the public opinions and crisis management report. Social Science Literature Press, Beijing, 1–12 (in Chinese)
Xie W, Zhu F, Jiang J, Lim EP, Wang K (2013) Topicsketch: real-time bursty topic detection from twitter. In: 2013 IEEE 13th International Conference on Data Mining 837–846
Yang Y, Pierce T, Carbonell J (1998) A study of retrospective and on-line event detection. In: Proceedings of the 21st annual international ACM SIGIR Conference on Research and development in Information Retrieval 28–36
Yang Y, Carbonell JG, Brown RD, Pierce T, Archibald BT, Liu X (1999) Learning approaches for detecting and tracking news events. IEEE Intell Syst 14:32–43. doi:10.1109/5254.784083
Youtube (2016) www.youtube.com, Accessed December 2015
Zhao Q, Mitra P (2007). Event detection and visualization for social text streams. In: International Conference of Weblogs and Social Media. ICWSM
Zhao Q, Mitra P, Chen B (2007) Temporal and information flow based event detection from social text streams. In: Proceedings of the 22nd National Conference on Artificial Intelligence 2: 1501–1506
Zhao J, Wang X, Ma Z (2014) Towards events detection from microblog messages. Int J Hybrid Inf Technol 7(1):201–210
Zhi Li Wu, Chun Hung Li (2007) Topic detection in online discussion using non-negative matrix factorization. In: Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology Workshops 272–275
Zhou X, Chen L (2014) Event Detection over twitter social media streams. VLDB J 23(3):381–400. doi:10.1007/s00778-013-0320-3
Zhou D, Chen L, He Y (2014) A simple bayesian modelling approach to event extraction from twitter. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, Maryland, USA 700–705
Zhu M, Hu W, Wu O (2008) Topic detection and tracking for threaded discussion communities. In: Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology 1:77–83
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Goswami, A., Kumar, A. A survey of event detection techniques in online social networks. Soc. Netw. Anal. Min. 6, 107 (2016). https://doi.org/10.1007/s13278-016-0414-1
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-016-0414-1