Abstract
Social networks, which have become extremely popular in the twenty first century, contain a tremendous amount of user-generated content about real-world events. This user-generated content relays real-world events as they happen, and sometimes even ahead of the newswire. The goal of this work is to identify events from social streams. The proposed model utilizes sliding window-based statistical techniques to extract event candidates from social streams. Subsequently, the “Concept-based evolving graph sequences” approach is employed to verify information propagation trends of event candidates and to identify those events. The experimental results show the usefulness of our approach in identifying real-world events in social streams.

















Similar content being viewed by others
References
Allan J (2002) Topic detection and tracking: event-based information organization., The information retrieval seriesSpringer, Berlin
Alvanaki F, Sebastian M, Ramamritham K, Weikum G (2011) Enblogue: emergent topic detection in web 2.0 streams. In: Proceedings of the 2011 ACM SIGMOD international conference on management of data, ACM, p 1271–1274
Alvanaki F, Michel S, Ramamritham K, Weikum G (2012) See what’s enblogue: real-time emergent topic identification in social media. In: Proceedings of the 15th international conference on extending database technology, ACM, p 336–347
Bakshy E, Rosenn I, Marlow C, Adamic LA (2012) The role of social networks in information diffusion. In: Proceedings of World Wide Web, p 519–528
Becker H, Naaman M, Gravano L (2011) Beyond trending topics: real-world event identification on twitter. In: Proceedings of international AAAI conference on weblogs and social media, p 438–441
Bell M (2011) Sohaib athar’s tweets from the attack on osama bin laden. http://www.washingtonpost.com/blogs/blogpost/post/sohaib-athar-tweeted-the-attack-on-osama-bin-laden-without-knowing-it/2011/05/02/AF4c9xXF_blog.html. Accessed 10 Dec 2013
Blei DM, Lafferty JD (2006) Dynamic topic models. In: Proceedings of the 23rd international conference on machine learning, ACM, p 113–120
Broecheler M, Shakarian P, Subrahmanian V (2010) A scalable framework for modeling competitive diffusion in social networks. In: Proceedings of the IEEE second international conference on social computing (SocialCom), IEEE, p 295–302
Cataldi M, Di Caro L, Schifanella C (2010) Emerging topic detection on twitter based on temporal and social terms evaluation. In: Proceedings of the tenth international workshop on multimedia data mining, ACM, MDMKDD ’10, p 4:1–4:10
Diao Q, Jiang J, Zhu F, Lim EP (2012) Finding bursty topics from microblogs. In: Proceedings of the 50th annual meeting of the association for computational linguistics: long papers-Volume 1, Association for Computational Linguistics, p 536–544
Dou W, Wang X, Skau D, Ribarsky W, Zhou MX (2012) Leadline: interactive visual analysis of text data through event identification and exploration. In: Proceedings of IEEE conference on visual analytics science and technology (VAST), IEEE, p 93–102
Du Y, He Y, Tian Y, Chen Q, Lin L (2011) Microblog bursty topic detection based on user relationship. In: Proceedings of the 6th IEEE Joint international conference on information technology and artificial intelligence (ITAIC), IEEE, vol 1. p 260–263
Fung GPC, Yu JX, Yu PS, Lu H (2005) Parameter free bursty events detection in text streams. In: Proceedings of the 31st international conference on very large data bases, VLDB endowment, p 181–192
Gottron T, Radcke O, Pickhardt R (2013) On the temporal dynamics of influence on the social semantic web. In: Springer proceedings in complexity on semantic web and web science, Springer, p 75–87
Granovetter M (1973) The strength of weak ties. Am J Sociol 78(6):1360–1380
Guzman J, Poblete B (2013) On-line relevant anomaly detection in the twitter stream: an efficient bursty keyword detection model. In: Proceedings of the ACM SIGKDD workshop on outlier detection and description, ACM, p 31–39
Hong L, Ahmed A, Gurumurthy S, Smola AJ, Tsioutsiouliklis K (2012) Discovering geographical topics in the twitter stream. In: Proceedings of the 21st international conference on World Wide Web, ACM, p 769–778
Kumaran G, Allan J (2004) Text classification and named entities for new event detection. In: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval, ACM, p 297–304
Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of World Wide Web
Kwan E, Hsu PL, Liang JH, Chen YS (2013) Event identification for social streams using keyword-based evolving graph sequences. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining, ACM, ASONAM ’13, p 450–457
Ma H, Wang B, Li N (2012) A novel online event analysis framework for micro-blog based on incremental topic modeling. In: Proceedings of the 13th ACIS international conference on software engineering, artificial intelligence, networking and parallel & distributed computing (SNPD), p 73–76
Mathioudakis M, Koudas N (2010) Twittermonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM SIGMOD international conference on management of data, ACM, p 1155–1158
Mihalcea R, Tarau P (2004) Textrank: bringing order into texts. In: Proceedings of EMNLP 2004, association for computational linguistics, p 404–411
Naaman M, Boase J, Lai CH (2010) Is it really about me?: message content in social awareness streams. In: Proceedings of the 2010 ACM conference on computer supported cooperative work, CSCW ’10, p 189–192
Ohsawa Y, Benson NE, Yachida M (1998) Keygraph: Automatic indexing by co-occurrence graph based on building construction metaphor. In: Proceedings IEEE international forum on research and technology advances in digital libraries (ADL), IEEE, p 12–18
Petrovic S, Osborne M, McCreadie R, Macdonald C, Ounis I, Shrimpton L (2013) Can twitter replace newswire for breaking news? In: Proceedings of the seventh international AAAI conference on weblogs and social media, The AAAI Press
Popescu AM, Pennacchiotti M (2010) Detecting controversial events from twitter. In: Proceedings of the 19th ACM international conference on information and knowledge management, p 1873–1876
Pratt SF, Giabbanelli PJ, Mercier JS (2013) Detecting unfolding crises with visual analytics and conceptual maps emerging phenomena and big data. In: Proceedings of the IEEE international conference onintelligence and security informatics (ISI), IEEE, p 200–205
Rapoport A (1953) Spread of information through a population with socio-structural bias: I. Assumption of transitivity. Bull Math Biophys 15(4):523–533
Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th international conference on World Wide Web, p 851–860
Sankaranarayanan J, Samet H, Teitler BE, Lieberman MD, Sperling J (2009) Twitterstand: News in tweets. In: Proceedings of the 17th ACM SIGSPATIAL international conference on advances in geographic information systems, p 42–51
Sayyadi H, Hurst M, Maykov A (2009) Event detection and tracking in social streams. In: Proceedings of international AAAI conference on weblogs and social media
Seo E, Mohapatra P, Abdelzaher T (2012) Identifying rumors and their sources in social networks. In: Proceedings of the SPIE conference on defense, security, and sensing, p 83891I
Shakarian P, Simari GI, Callahan D (2013) 29th internatioal conference on logic programming (ICLP-13) (tech.communication), Istanbul, Turkey, 24–28 Aug 2013
Shuyo N (2010) Language detection library for java. http://code.google.com/p/language-detection/. Accessed 10 Dec 2013
Twitter (2012) Twitter turns six. http://blog.twitter.com/2012/03/twitter-turns-six.html. Accessed 10 Dec 2013
Valkanas G, Gunopulos D (2013) How the live web feels about events. In: Proceedings of the 22nd ACM international conference on information & knowledge management, ACM, p 639–648
Wang X, Zhai C, Hu X, Sproat R (2007) Mining correlated bursty topic patterns from coordinated text streams. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, ACM, p 784–793
Wasserman T (2012) Twitter says it has 140 million users. http://mashable.com/2012/03/21/twitter-has-140-million-users/. Accessed 10 Dec 2013
Weng J, Lee BS (2011) Event detection in twitter. In: Proceedings of the international conference on weblogs and social media
Zacks JM, Tversky B (2001) Event structure in perception and conception. Psychol Bull 127:3
Author information
Authors and Affiliations
Corresponding author
Additional information
This article is part of the Social Network Analysis and Information Systems.
Rights and permissions
About this article
Cite this article
Chen, YS., Peng, YC., Liang, JH. et al. Concept-based event identification from social streams using evolving social graph sequences. Soc. Netw. Anal. Min. 5, 30 (2015). https://doi.org/10.1007/s13278-015-0269-x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-015-0269-x