Skip to main content
Log in

Evolutionary taxonomy construction from dynamic tag space

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Collaborative tagging becomes a common feature of current web sites, facilitating ordinary users to annotate and represent online resources. The large collection of tags and their relationships form a tag space. In this kind of tag space, the popularity and correlation amongst tags capture the current social interests. Tags are freely chosen keywords and difficult to organize. As a hierarchical concept structure to represent the subsumption relationships, automatically extracted taxonomies become a viable method to manage collaborative tags. However, tags change over time, and it is also imperative to incorporate the temporal tag evolution into the extracted taxonomies. In this paper, we formalize the problem of evolutionary taxonomy generation over a large collection of tags. A line of taxonomies are generated to reflect the temporal changes of underlying tag space. The proposed evolutionary taxonomy framework consists of two novel contributions. First, we develop a context-aware edge selection algorithm for taxonomy extraction. This method is built on seminal association-rule mining algorithm. Second, we propose several strategies for evolutionary taxonomy fusion, which smooths the newly generated taxonomy with prior ones. We conduct an extensive performance study using a large real-life web page tagging dataset (i.e., Del.ici.ous). The empirical results clearly verify the effectiveness and efficiency of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Agrawal, R., Imielinski, T.: Mining association rules between sets of items in large databases. In: Prof. of SIGMOD, pp. 207–216 (1993)

  2. Augsten, N., Böhlen, M., Gamper, J.: Approximate matching of hierarchical data using pq-grams. In: Proc. of VLDB, pp. 301–312 (2005)

  3. Bao, S.-H., Yang, B.-H., Fei, B., Xu, S.-L., Su, Z., Yu, Y.: Social propagation: boosting social annotations for web mining. World Wide Web 12(4), 399–420 (2009)

    Article  Google Scholar 

  4. Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Prof. of SIGMOD, pp. 255–264 (1997)

  5. Cattuto, C., Benz, D., Hotho, A., Stumme, G.: Semantic grounding of tag relatedness in social bookmarking systems. In: 7th international semantic web conference, pp. 615–631 (2008)

  6. Cattuto, C., Schmitz, C., Baldassarri, A., Servedio, V.D.P., Loreto, V., Hotho, A., Grahl, M., Stumme, G.: Network properties of folksonomies. AI Commun. 20(4), 245–262 (2007)

    MathSciNet  Google Scholar 

  7. Corter, J.E., Gluck, M.A.: Explaining basic categories: feature predictability and information. In: Psychol. Bull. 111(2), 291–303 (1992)

  8. Doan, A.H., Ramakrishnan, R., Halevy, A.Y.: Crowdsourcing systems on the world-wide web. Commun. ACM 54(4), 86–96 (2011)

    Article  Google Scholar 

  9. Dubinko, M., Kumar, R., Magnani, J., Novak, J., Raghavan, P., Tomkins, A.: Visualizing tags over time. In: Proc. of WWW, pp. 193–202 (2006)

  10. Eda, T., Yoshikawa, M., Uchiyama, T., Uchiyama, T.: The effectiveness of latent semantic analysis for building up a bottom-up taxonomy from folksonomy tags. World Wide Web 12(4), 421–440 (2009)

    Article  Google Scholar 

  11. Fontoura, M., Josifovski, V., Kumar, R., Olston, C., Tomkins, A., Vassilvitskii, S.: Relaxation in text search using taxonomies. In: Proc. of VLDB, pp. 672–683 (2008)

  12. Golder, S.A., Huberman, B.A.: Usage patterns of collaborative tagging systems. J. Inf. Sci. 32(2), 198–208 (2006)

    Article  Google Scholar 

  13. Halpin, H., Robu, V., Shepherd, H.: The complex dynamics of collaborative tagging. In: Proc. of WWW, pp. 211–220 (2007)

  14. Heymann, P., Garcia-Molina, H.: Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical Report 2006-10, Stanford University (2006)

  15. Heymann, P., Koutrika, G., Garcia-Molina, H.: Can social bookmarking improve web search? In: Proc. of WSDM, pp. 195–206 (2008)

  16. Heymann, P., Ramage, D., Garcia-Molina, H.: Social tag prediction. In: Proc. of SIGIR, pp. 531–538 (2008)

  17. Plangprasopchok, A., Lerman, K., Getoor, L.: Growing a tree in the forest: constructing folksonomies by integrating structured metadata. In: Proc. of KDD, pp. 949–958 (2010)

  18. Schenkel, R., Crecelius, T., Kacimi, M., Michel, S., Neumann, T., Parreira, J.X., Weikum, G.: Efficient top-k querying over social-tagging networks. In: Proc. of SIGIR, pp. 523–530 (2008)

  19. Schwarzkopf, E., Heckmann, D., Dengler, D., Kroner, A.: Mining the structure of tag spaces for user modeling. In: Proc. of the Workshop on Data Mining for User Modeling, pp. 63–75 (2007)

  20. Siorpaes, K., Simperl, E.: Human intelligence in the process of semantic content creation. World Wide Web 13(1), 33–59 (2010)

    Article  Google Scholar 

  21. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Francisco (2005)

    MATH  Google Scholar 

  22. Yahia, S.A., Benedikt, M., Lakshmanan, L.V.S., Stoyanovich, J.: Efficient network aware search in collaborative tagging sites. In: Proc. of VLDB, pp. 710–721 (2008)

  23. Yang, H., Callan, J.: A metric-based framework for automatic taxonomy induction. In: Proc. of ACL, pp. 271–279 (2009)

  24. Yao, J.J., Cui, B., Huang, Y.X., Zhou, Y.H.: Bursty event detection from collaborative tags. World Wide Web (2011). doi:10.1007/s11280-011-0136-2

    Google Scholar 

  25. Zhang, K., Shasha, D.: Simple fast algorithms for the editing distance between trees and related problems. SIAM J Comput 18(6), 1245–1262 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  26. Zhou, D., Bian, J., Zheng, S., Zha, H., Giles, C.L.: Exploring social annotations for information retrieval. In: Proc. of WWW, pp. 715–724 (2008)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bin Cui.

Additional information

The preliminary version of this paper appeared at WISE 2010.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yao, J., Cui, B., Cong, G. et al. Evolutionary taxonomy construction from dynamic tag space. World Wide Web 15, 581–602 (2012). https://doi.org/10.1007/s11280-011-0150-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-011-0150-4

Keywords

Navigation