Skip to main content
Log in

Bursty event detection from collaborative tags

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Collaborative tagging have emerged as a ubiquitous way to annotate and organize online resources. As a kind of descriptive keyword, large amount of tags are created and associated to multiple types of resources, e.g., web pages, photos, videos and tweets. Users’ tagging actions over time reflect their changing interests. Monitoring and analyzing the temporal patterns of tags can provide important insights to trace hot topics on the web. Existing work focuses on deriving temporal patterns for individual tags. However, there exist remarkable correlations among tags assigned to online resources. In this paper, we propose a new approach to detect bursty tagging event, which captures the relations among a group of correlated tags where the tags are either bursty or associated with bursty tag co-occurrence. This kind of bursty tagging event generally corresponds to a real life event. It profiles the events with more representative and comprehensible clues. The proposed approach is divided into three stages. We exploit the sliding time intervals to extract bursty features as the first step, and then adopt graph clustering techniques to group bursty features into meaningful bursty events. We discuss the choice of similarity and granularity for event detection. After that, we further utilize an automatically generated tag taxonomy to organize bursty events to facilitate the burst oriented navigation and analysis. The experimental study on a large real data set demonstrates the superiority of our new approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Baeza-Yates, R.: User generated content: how good is it? In: Proc. of WICOW, pp. 1–2 (2009)

  2. Bansal, N., Chiang, F., Koudas, N., Tompa, F.: Seeking stable clusters in the blogosphere. In: Proc. of VLDB, pp. 806–817 (2007)

  3. Bao, S.-H., Yang, B.-H., Fei, B., Xu, S.-L., Su, Z., Yu, Y.: Social propagation: boosting social annotations for web mining. World Wide Web 12(4), 399–420 (2009)

    Article  Google Scholar 

  4. Blei, D.M., Lafferty, J.D.: Dynamic topic models. In: Proc. of ICML, pp. 113–120 (2006)

  5. Cui, B., Yao, J.-J., Cong, G., Huang, Y.-X.: Evolutionary taxonomy construction from dynamic tag space. In: Proc. of WISE, pp. 105–119 (2010)

  6. Dhillon, I.S., Guan, Y., Kulis, B.: Weighted graph cuts without eigenvectors a multilevel approach. IEEE Trans. PAMI 29(11), 1944–1957 (2007)

    Article  Google Scholar 

  7. Eda, T., Yoshikawa, M., Uchiyama, T., Uchiyama, T., The effectiveness of latent semantic analysis for building up a bottom-up taxonomy from folksonomy tags. World Wide Web 12(4), 421–440 (2009)

    Article  Google Scholar 

  8. Fung, G.P.C., Yu, J., Yu, P.S., Lu, H.J.: Parameter free bursty events detection in text streams. In: Proc. of VLDB, pp. 181–192 (2005)

  9. Gruhl, D., Guha, R., Liben-Nowell, D., Tomkins, A.: Information diffusion through blogspace. In: Proc. of WWW, pp. 491–501 (2004)

  10. Heymann, P., Garcia-Molina, H.: Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems. Technical Report 2006-10, Stanford University (2006)

  11. Heymann, P., Koutrika, G., Garcia-Molina, H.: Can social bookmarking improve web search? In: Proc. of WSDM, pp. 195–206 (2008)

  12. Heymann, P., Ramage, D., Garcia-Molina, H.: Social tag prediction. In: Proc. of SIGIR, pp. 531–538 (2008)

  13. Joshi, D., Perez, D.-G.: Discovering groups of people in google news. In: Proc. of HCM, pp. 55–64 (2006)

  14. Kleinberg, J.: Bursty and hierarchical structure in streams. In: Proc. of KDD, pp. 91–101 (2002)

  15. Koutrika, G., Effendi, F.A., Gyöngyi, Z., Heymann, P., Garcia-Molina, H.: Combating spam in tagging systems: an evaluation. ACM Trans. Web 2(4), 1–34 (2008)

    Article  Google Scholar 

  16. Leskovec, J., Backstorm, L., Kleinberg, J.: Meme-tracking and the dynamics of the news cycle. In: Proc. of KDD (2009)

  17. Lin, C.-X., Zhao, B., Mei, Q., Han, J.-W.: Pet: a statistical model for popular events tracking in social communities. In: Proc. of KDD, pp. 929–938 (2010)

  18. Lu, C.-M., Hu, X.-H., Chen, X., Park, J.-R., He, T.-T., Li, Z.-J.: The topic-perspective model for social tagging systems. In: Proc. of KDD, pp. 683–692 (2010)

  19. Mei, Q., Liu, C., Su, H., Zhai, C.-X.: A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In: Proc. of WWW, pp. 533–542 (2006)

  20. Plangprasopchok, A., Lerman, K., Getoor, L.: Growing a tree in the forest: constructing folksonomies by integrating structured metadata. In: Proc. of KDD, pp. 949–958 (2010)

  21. Rattenbury, T., Good, N., Naaman, M.: Towards automatic extraction of event and place semantics from flickr tags. In: Proc. of SIGIR, pp. 103–110 (2007)

  22. Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proc. of WWW, pp. 851–860 (2010)

  23. Sarkas, N., Das, G., Koudas, N.: Improved search for socially annotated data. In: Proc. of VLDB Endow. (1), pp. 778–789 (2009)

  24. Shamma, D.-A., Kennedy, L., Churchill, E.: Tweet the debates: understanding community annotation of uncollected sources. In: WSM 09 SIGMM Workshop (2009)

  25. Singh, V.-K., Gao, M., Jain, R.: Situation detection and control using spatio-temporal analysis of microblogs. In: Proc. of WWW, pp. 1181–1182 (2010)

  26. Singh, V.-K., Gao, M., Jain, R.: Social pixels: genesis and evaluation. In: Proc. of ACM Multimedia, pp. 481–490 (2010)

  27. Vlachos, M., Meek, C., Vagena, Z., Gunopulos, D.: Identifying similarities, periodicities and bursts for online search queries. In: Proc. of SIGMOD, pp. 131–142 (2004)

  28. Wang, X., Zhai, C.-X., Hu, X., Sproat, R.: Mining correlated bursty topic patterns from coordinated text streams. In: Proc. of KDD, pp. 784–793 (2007)

  29. Yao, J.-J., Cui, B., Huang, Y.-X., Jin, X.: Temporal and social context based burst detection from folksonomies. In: Proc. of AAAI, pp. 1474–1479 (2010)

  30. Yao, J.-J., Cui, B., Huang, Y.-X., Zhou, Y.-H.: Detecting bursty events in collaborative tagging systems. In: Proc. of ICDE, pp. 780–783 (2010)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bin Cui.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yao, J., Cui, B., Huang, Y. et al. Bursty event detection from collaborative tags. World Wide Web 15, 171–195 (2012). https://doi.org/10.1007/s11280-011-0136-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-011-0136-2

Keywords

Navigation