Abstract
Event detection on Twitter has attracted active research. Although existing work considers the semantic topic structure of documents for event detection, the topic dynamics and the semantic consistency are under-investigated. In this paper, we study the problem of topical event detection in tweet streams. We define topical events as the bursty occurrences of semantically consistent topics. We decompose the problem of topical event detection into two components: (1) We address the issue of the semantic incoherence of the evolution of topics. We propose to improve topic modelling to filter out semantically inconsistent dynamic topics. (2) We propose to perform burst detection on the time series of dynamic topics to detect bursty occurrences. We apply our proposed techniques to the real world application by detecting topical events in public transport tweets. Experiments demonstrate that our approach can detect the newsworthy events with high success rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Allan, J.: Introduction to topic detection and tracking. In: Allan, J. (ed.) Topic Detection and Tracking, pp. 1–16. Springer, New York (2002)
Allan, J., Lavrenko, V., Jin, H.: First story detection in TDT is hard. In: Proceeding 19th International Conference on Information and Knowledge Management, pp. 374–381. ACM (2000)
Araujo, L., Cuesta, J.A., Merelo, J.J.: Genetic algorithm for burst detection and activity tracking in event streams. In: Runarsson, T.P., Beyer, H.-G., Burke, E.K., Merelo-Guervós, J.J., Whitley, L.D., Yao, X. (eds.) PPSN 2006. LNCS, vol. 4193, pp. 302–311. Springer, Heidelberg (2006)
Becker, H., Naaman, M., Gravano, L.: Beyond trending topics: real-world event identification on Twitter. ICWSM 11, 438–441 (2011)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. Mach. Learn. Res. 3, 993–1022 (2003)
Buntine, W.L., Mishra, S.: Experiments with non-parametric topic models. In: Proceeding of 20th ACM SIGKDD, pp. 881–890. ACM (2014)
Cordeiro, M.: Twitter event detection: combining wavelet analysis and topic inference summarization. In: Proceeding Doctoral Symposium on Informatics Engineering, DSIE, p. 123–138 (2012)
Fung, G.P.C., Yu, J.X., Yu, P.S., Lu, H.: Parameter free bursty events detection in text streams. In: Proceeding of 31st International Conference on Very Large Data Bases, pp. 181–192. VLDB Endowment (2005)
He, Q., Chang, K., Lim, E.P.: Analyzing feature trajectories for event detection. In: Proceeding of 30th ACM SIGIR, pp. 207–214. ACM (2007)
Kleinberg, J.: Bursty and hierarchical structure in streams. Data Min. Knowl. Disc. 7(4), 373–397 (2003)
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)
Li, J., Buntine, W.: Experiments with dynamic topic models. In: Proceeding of NewsKDD workshop on Data Science for News Publishing. ACM (2014)
Metzler, D., Cai, C., Hovy, E.: Structured event retrieval over microblog archives. In: Proceeding of the North American Chapter of the Association for Computational Linguistics, pp. 646–655. Association for Computational Linguistics (2012)
Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proc. Empirical Methods in Natural Language Processing. pp. 262–272. Association for Computational Linguistics (2011)
Newman, D., Lau, J.H., Grieser, K., Baldwin, T.: Automatic evaluation of topic coherence. In: Proceeding of the North American Chapter of the Association for Computational Linguistics, pp. 100–108. Association for Computational Linguistics (2010)
Pan, C.C., Mitra, P.: Event detection with spatial latent dirichlet allocation. In: Proceeding of 11th ACM/IEEE International joint Conference on Digital libraries, pp. 349–358. ACM (2011)
Petrović, S., Osborne, M., Lavrenko, V.: Streaming first story detection with application to Twitter. In: Proceeding of the North American Chapter of the Association for Computational Linguistics, pp. 181–189. Association for Computational Linguistics (2010)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceeding of 19th International Conference WWW, pp. 851–860. ACM (2010)
Sankaranarayanan, J., Samet, H., Teitler, B.E., Lieberman, M.D., Sperling, J.: Twitterstand: news in tweets. In: Proceeding 17th ACM SIGSPATIAL International Conference on advances in geographic information systems, pp. 42–51. ACM (2009)
Wang, X., Grimson, E.: Spatial latent Dirichlet allocation. In: Proceeding Advances in Neural Information Processing Systems, pp. 1577–1584 (2008)
Weng, J., Lee, B.S.: Event detection in Twitter. ICWSM 11, 401–408 (2011)
Yao, L., Mimno, D., McCallum, A.: Efficient methods for topic model inference on streaming document collections. In: Proceeding of 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 937–946. ACM (2009)
Zhou, X., Chen, L.: Event detection over Twitter social media streams. Int. J. VLDB 23(3), 381–400 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Cui, L., Zhang, X., Zhou, X., Salim, F. (2016). Topical Event Detection on Twitter. In: Cheema, M., Zhang, W., Chang, L. (eds) Databases Theory and Applications. ADC 2016. Lecture Notes in Computer Science(), vol 9877. Springer, Cham. https://doi.org/10.1007/978-3-319-46922-5_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-46922-5_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46921-8
Online ISBN: 978-3-319-46922-5
eBook Packages: Computer ScienceComputer Science (R0)