Skip to main content
Log in

Event detection over twitter social media streams

  • Regular Paper
  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract

In recent years, microblogs have become an important source for reporting real-world events. A real-world occurrence reported in microblogs is also called a social event. Social events may hold critical materials that describe the situations during a crisis. In real applications, such as crisis management and decision making, monitoring the critical events over social streams will enable watch officers to analyze a whole situation that is a composite event, and make the right decision based on the detailed contexts such as what is happening, where an event is happening, and who are involved. Although there has been significant research effort on detecting a target event in social networks based on a single source, in crisis, we often want to analyze the composite events contributed by different social users. So far, the problem of integrating ambiguous views from different users is not well investigated. To address this issue, we propose a novel framework to detect composite social events over streams, which fully exploits the information of social data over multiple dimensions. Specifically, we first propose a graphical model called location-time constrained topic (LTT) to capture the content, time, and location of social messages. Using LTT, a social message is represented as a probability distribution over a set of topics by inference, and the similarity between two messages is measured by the distance between their distributions. Then, the events are identified by conducting efficient similarity joins over social media streams. To accelerate the similarity join, we also propose a variable dimensional extendible hash over social streams. We have conducted extensive experiments to prove the high effectiveness and efficiency of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Notes

  1. http://en.wikipedia.org/wiki/Beta_distribution.

References

  1. Allan, J., Papka, R., Lavrenko, V.: On-line new event detection and tracking. In: SIGIR, pp. 37–45 (1998)

  2. AlSumait, L., Barbará, D., Domeniconi, C.: On-line lda: adaptive topic models for mining text streams with applications to topic detection and tracking. In: ICDM, pp. 3–12 (2008)

  3. Beckmann, N., Kriegel, H.-P., Schneider, R., Seeger, B.: The r*-tree: an efficient and robust access method for points and rectangles. In: SIGMOD, pp. 322–331 (1990)

  4. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  5. Chang, Y.-L., Chien, J.-T.: Latent dirichlet learning for document summarization. In: ICASSP, pp. 1689–1692 (2009)

  6. Cheng, Z., Caverlee, J., Lee, K.: You are where you tweet: a content-based approach to geo-locating twitter users. In: CIKM, pp. 759–768 (2010)

  7. Fiscus, J.G., Doddington, G.R.: Topic detection and tracking evaluation overview. In: Allan, J. (ed.) Topic detection and Tracking, pp. 17–31. Kluwer Academic Publishers, Norwell, USA (2002)

  8. Fung, G.P.C., Yu, J.X., Yu, P.S., Lu, H.: Parameter free bursty events detection in text streams. In: VLDB, pp. 181–192 (2005)

  9. Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: SIGMOD, pp. 47–57 (1984)

  10. http://en.wikipedia.org/wiki/kullback

  11. http://en.wikipedia.org/wiki/twitter

  12. Hofmann, T.: Probabilistic latent semantic indexing. In: SIGIR, pp. 50–57 (1999)

  13. Jagadish, H.V., Ooi, B.C., Tan, K.-L., Yu, C., Zhang, R.: iDistance: an adaptive b+-tree based indexing method for nearest neighbor search. TODS 30(2), 364–397 (2005)

    Article  Google Scholar 

  14. Lin, C.X., Mei, Q., Han, J., Jiang, Y., Danilevsky, M.: The joint inference of topic diffusion and evolution in social communities. In: ICDM, pp. 378–387 (2011)

  15. Lin, J., Snow, R., Morgan, W.: Smoothing techniques for adaptive online language models: topic tracking in tweet streams. In: KDD, pp. 422–429 (2011)

  16. Lin, S., Özsu, M.T., Oria, V., Ng, R.T.: An extendible hash for multi-precision similarity querying of image databases. In: VLDB, pp. 221–230 (2001)

  17. Liu, S., Zhou, M.X., Pan, S., Qian, W., Cai, W., Lian, X.: Interactive, topic-based visual text summarization and analysis. In: CIKM, pp. 543–552 (2009)

  18. Rattenbury, T., Good, N., Naaman, M.: Towards automatic extraction of event and place semantics from flickr tags. In: SIGIR, pp. 103–110 (2007)

  19. Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: WWW, pp. 851–860 (2010)

  20. Sizov, S.: Geofolk: latent spatial semantics in web 2.0 social media. In: WSDM, pp. 281–290 (2010)

  21. Wan, X., Milios, E., Kalyaniwalla, N., Janssen, J.: Link-based event detection in email communication networks. In: SAC, pp. 1506–1510 (2009)

  22. Wang, J., Zhao, Z., Zhou, J., Wang, H., Cui, B., Qi, G.: Recommending flickr groups with social topic model. Inf. Retr. 15(3–4), 278–295 (2012)

    Article  Google Scholar 

  23. Wang, X., McCallum, A.: Topics over time: a non-markov continuous-time model of topical trends. In: KDD, pp. 424–433 (2006)

  24. Wang, Y., Sundaram, H., Xie, L.: Social event detection with interaction graph modeling. In: ACM Multimedia, pp. 865–868 (2012)

  25. Wei, X., Croft, W.B.: Lda-based document models for ad-hoc retrieval. In: SIGIR, pp. 178–185 (2006)

  26. White, R.W., Jose, J.M.: A study of topic similarity measures. In: SIGIR, pp. 520–521 (2004)

  27. Yang, Y., Pierce, T., Carbonell, J.G.: A study of retrospective and on-line event detection. In: SIGIR, pp. 28–36 (1998)

  28. Yao, J., Cui, B., Huang, Y., Jin, X.: Temporal and social context based burst detection from folksonomies. In: AAAI, pp. 1474–1479 (2010)

  29. Yao, J., Cui, B., Xue, Z., Liu, Q.: Provenance-based indexing support in micro-blog platforms. In: ICDE, pp. 558–569 (2012)

  30. Yin, H., Cui, B., Li, J., Yao, J., Chen, C.: Challenging the long tail recommendation. PVLDB 5(9), 896–907 (2012)

    Google Scholar 

  31. Yin, H., Cui, B., Lu, H., Huang, Y., Yao, J.: A unified model for stable and temporal topic detection from social media data. In: ICDE, pp. 618–629 (2013)

  32. Yin, J., Lampert, A., Cameron, M., Robinson, B., Power, R.: Using social media to enhance emergency situation awareness. IEEE Intell. Syst. 27(6), 52–59 (2012)

    Google Scholar 

  33. Yin, Z., Cao, L., Han, J., Zhai, C., Huang, T.S.: Geographical topic discovery and comparison. In: WWW, pp. 247–256 (2011)

  34. Yu, C., Ooi, B.C., Tan, K.-L., Jagadish, H.V.: Indexing the distance: an efficient method to knn processing. In: VLDB, pp. 421–430 (2001)

  35. Zhang, K., Zi, J., Wu, L.G.: New event detection based on indexing-tree and named entity. In: SIGIR, pp. 215–222 (2007)

  36. Zhao, Q., Mitra, P.: Event detection and visualization for social text streams. In: ICWSM (2007)

  37. Zhao, Q., Mitra, P., Chen, B.: Temporal and information flow based event detection from social text streams. In: AAAI, pp. 1501–1506 (2007)

  38. Zhou, X., Zhou, X., Chen, L., Bouguettaya, A.: Efficient subsequence matching over large video databases. VLDB J. 21(4), 489–508 (2012)

    Article  Google Scholar 

  39. Zhou, X., Zhou, X., Chen, L., Shu, Y., Bouguettaya, A., Taylor, J.A.: Adaptive subspace symbolization for content-based video detection. IEEE Trans. Knowl. Data Eng. 22(10), 1372–1387 (2010)

    Article  Google Scholar 

  40. Zunjarwad, A., Sundaram, H., Xie, L.: Contextual wisdom: social relations and correlations for multimedia event annotation. In: ACM Multimedia, pp. 615–624 (2007)

Download references

Acknowledgments

Funding for this work was supported by the Hong Kong RGC GRF Project No. 611411, National Grand Fundamental Research 973 Program of China under Grant 2012-CB316200, Huawei Noah’s Ark Lab under Project HWLB06-15C03212/13PN, HP IRP Project 2011, and Microsoft Research Asia Gift Grant.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiangmin Zhou.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, X., Chen, L. Event detection over twitter social media streams. The VLDB Journal 23, 381–400 (2014). https://doi.org/10.1007/s00778-013-0320-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-013-0320-3

Keywords

Navigation