Skip to main content
Log in

The Effectiveness of Latent Semantic Analysis for Building Up a Bottom-up Taxonomy from Folksonomy Tags

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

In this paper, we evaluate the effectiveness of a semantic smoothing technique to organize folksonomy tags. Folksonomy tags have no explicit relations and vary because they form uncontrolled vocabulary. We discriminates so-called subjective tags like “cool” and “fun” from folksonomy tags without any extra knowledge other than folksonomy triples and use the level of tag generalization to form the objective tags into a hierarchy. We verify that entropy of folksonomy tags is an effective measure for discriminating subjective folksonomy tags. Our hierarchical tag allocation method guarantees the number of children nodes and increases the number of available paths to a target node compared to an existing tree allocation method for folksonomy tags.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30(1–7), 107–117 (1998)

    Article  Google Scholar 

  2. Brooks, C.H., Montanez, N.: Improved annotation of the blogosphere via autotagging and hierarchical clustering. In: Proc. International World Wide Web Conference (WWW), pp. 625–632. ACM, New York (2006)

    Chapter  Google Scholar 

  3. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1999)

    Article  Google Scholar 

  4. Dubinko, M., Kumar, R., Magnani, J., Novak, J., Raghavan, P., Tomkins, A.: Visualizing tags over time. In: Proc. International World Wide Web Conference, pp. 193–202. ACM, New York (2006)

    Chapter  Google Scholar 

  5. Eda, T., Uchiyama, T., Uchiyama, T., Yoshikawa, M.: Signaling emotion in tagclouds. In: Proc. International World Wide Web Conference (WWW), pp. 1199–1200. ACM, New York (2009)

    Chapter  Google Scholar 

  6. Eda, T., Yoshikawa, M., Yamamuro, M.: Locally expandable allocation of folksonomy tags in a directed acyclic graph. In: Proc. Web Information Systems Engineering (WISE), pp. 151–162 (2008)

  7. Golder, S., Huberman, B.A.: Usage patterns of collaborative tagging systems. J. Inf. Sci. 32, 198–208 (2006)

    Article  Google Scholar 

  8. Heymann, P., Garcia-Molina, H.: Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical report, Stanford InfoLab. (2006)

  9. Heymann, P., Koutrika, G., Garcia-Molina, H.: Can social bookmarking improve web search? In: Proc. International Conference on Web Search and Data Mining (WSDM), pp. 195–206 (2008)

  10. Hofmann, T.: Probabilistic latent semantic analysis. In: Proc. Uncertainty in Artificial Intelligence, pp. 289–296, Stockholm (1999)

  11. Khy, S., Ishikawa, Y., Kitagawa, H.: A novelty-based clustering method for on-line documents. World Wide Web 11(1), 1–37 (2008)

    Article  Google Scholar 

  12. Kolari, P., Java, A., Finin, T.: Characterizing the splogosphere. In: Proc. Workshop on the Weblogging Ecosystem, Edinburgh (2006)

  13. Lee, L.: On the effectiveness of the skew divergence for statistical language analysis. In: Proc. International Workshop on Artificial Intelligence and Statistics, pp. 65–77 (2001)

  14. Lin, C., Xue, G.-R., Zeng, H.-J., Yu, Y.: Using probabilistic latent semantic analysis for personalized web search. In: Proc. Asia Pacific Web Conference (APWeb). Lecture Notes in Computer Science, vol. 3399, pp. 707–717. Springer, New York (2005)

    Google Scholar 

  15. Lux, M., Granitzer, M., Kern, R.: Aspects of broad folksonomies. In: Proc. International Conference on Database and Expert Systems Applications (DEXA), pp. 283–287 (2007)

  16. Martin, H., Keane, M.T.: An assessment of tag presentation techniques. In: Proc. International World Wide Web Conference (WWW), pp. 1313–1314 (2007)

  17. Mathes, A.: Folksonomies—cooperative classification and communication through shared metadata. http://www.adammathes.com/academic/computer-mediated-communication/folksonomies.html (2004)

  18. Mika, P.: Ontologies are us: a unified model of social network and semantics. In: Proc. International Semantic Web Conference (ISWC), pp. 522–536 (2005)

  19. Niwa, S., Doi, T., Honiden, S.: Web page recommender system based on folksonomy mining. In: Proc. International Conference on Information Technology: New Generations, pp. 388–393 (2006)

  20. Ramamohanarao, K., Fan, H.: Patterns based classifiers. World Wide Web 10(1), 71–83 (2007)

    Article  Google Scholar 

  21. Rivadeneira, A.W., Gruen, D.M., Muller, M.J., Millen, D.R.: Getting our head in the clouds: towards evaluation studies of tagclouds. In: Proc. Conference on Human Factors in Computing Systems (CHI), pp. 995–998 (2007)

  22. Sabou, M., Gracia, J., Angeletou, S., dAquin1, M., Motta, E.: Evaluating the semantic web: a task-based approach. In: Proc. International SemanticWeb Conference and Asian Semantic Web Conference (ISWC/ASWC). Lecture Notes in Computer Science, vol. 4825/2008, pp. 423–427 (2007)

  23. Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)

    Article  MATH  Google Scholar 

  24. Sen, S., Lam, S.K., Rashid, A.M., Cosley, D., Frankowski, D., Osterhouse, J., Harper, M.F., Riedl, J.: Tagging, communities, vocabulary, evolution. In: Conference on Computer Supported Cooperative Work (CSCW), pp. 181–190, New York (2006)

  25. Sinclair, J., Cardew-Hall, M.: The folksonomy tag cloud: when is it useful? J. Inf. Sci. 34, 15–29 (2007)

    Article  Google Scholar 

  26. Voss, J.: Tagging, folksonomy & co-renaissance of manual indexing? http://arxiv.org/abs/cs/0701072v2 (2007)

  27. Wu, X., Zhang, L., Yu, Y.: Exploring social annotations for the semantic web. In: Proc. International World Wide Web Conference (WWW), pp. 417–426 (2006)

  28. Yanbe, Y., Jatowt, A., Nakamura, S., Tanaka, K.: Can social bookmarking enhance search in the web? In: Proc. ACM/IEEE Joint Conference on Digital Libraries (JCDL), pp. 107–116 (2007)

  29. Yee, K.-P., Swearingen, K., Li, K., Hearst, M.: Faceted metadata for image search and browsing. In: Conference on Human Factors in Computing Systems (CHI), pp. 401–408. ACM, New York (2003)

    Chapter  Google Scholar 

  30. Yeung, C.-m.A., Gibbins, N., Shadbolt, N.: Tag meaning disambiguation through analysis of tripartite structure of folksonomies. In: Proc. International Conferences on Web Intelligence and Intelligent Agent Technology (WI/IAT), pp. 3–6 (2007)

  31. Zhou, D., Bian, J., Zheng, S., Zha, H., Giles, C.L.: Exploring social annotations for information retrieval. In: Proc. International World Wide Web Conference (WWW), pp. 715–724 (2008)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Takeharu Eda.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Eda, T., Yoshikawa, M., Uchiyama, T. et al. The Effectiveness of Latent Semantic Analysis for Building Up a Bottom-up Taxonomy from Folksonomy Tags. World Wide Web 12, 421–440 (2009). https://doi.org/10.1007/s11280-009-0069-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-009-0069-1

Keywords

Navigation