Skip to main content
Log in

A probabilistic framework of preference discovery from folksonomy corpus

  • Research Article
  • Published:
Frontiers of Computer Science Aims and scope Submit manuscript

Abstract

The increasing availability of folksonomy data makes them vital for user profiling approaches to precisely detect user preferences and better understand user interests, so as to render some personalized recommendation or retrieval results. This paper presents a rigorous probabilistic framework to discover user preference from folksonomy data. Furthermore, we incorporate three models into the framework with the corresponding inference methods, expectation-maximization or Gibbs sampling algorithms. The user preference is expressed through topical conditional distributions. Moreover, to demonstrate the versatility of our framework, a recommendation method is introduced to show the possible usage of our framework and evaluate the applicability of the engaged models. The experimental results show that, with the help of the proposed framework, the user preference can be effectively discovered.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Sen S, Vig J, Riedl J. Tagommenders: connecting users to items through tags. In: Proceedings of the 18th International Conference on World Wide Web. 2009, 671–680

    Chapter  Google Scholar 

  2. Wetzker R, Zimmermann C, Bauckhage C, Albayrak S. I tag, you tag: translating tags for advanced user models. In: Proceedings of the 3rd ACM International Conference onWeb Search and DataMining. 2010, 71–80

    Google Scholar 

  3. Liang H, Xu Y, Li Y, Nayak R, Tao X. Connecting users and items with weighted tags for personalized item recommendations. In: Proceedings of the 21st ACM Conference on Hypertext and Hypermedia. 2010, 51–60

    Chapter  Google Scholar 

  4. Hofmann T. Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 1999, 50–57

    Google Scholar 

  5. Blei D M, Ng A Y, Jordan M I, Lafferty J. Latent Dirichlet allocation. Journal of Machine Learning Research, 2003, 3: 993–1022

    MATH  Google Scholar 

  6. Rosen-Zvi M, Griffiths T L, Steyvers M, Smyth P. The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence. 2004, 487–494

    Google Scholar 

  7. Halpin H, Robu V, Shepherd H. The complex dynamics of collaborative tagging. In: Proceedings of the 16th International Conference on World Wide Web. 2007, 211–220

    Chapter  Google Scholar 

  8. Markines B, Cattuto C, Menczer F, Benz D, Hotho A, Stumme G. Evaluating similarity measures for emergent semantics of social tagging. In: Proceedings of the 18th International Conference onWorld Wide Web. 2009, 641–650

    Google Scholar 

  9. Xu H, Wang J, Hua X S, Li S. Tag refinement by regularized LDA. In: Proceedings of the 17th ACM International Conference onMultimedia. 2009, 573–576

    Google Scholar 

  10. Yin D W, Xue Z Z, Hong L J, Davison B D. A probabilistic model for personalized tag prediction. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2010, 959–968

    Google Scholar 

  11. Belém F, Martins E, Pontes T, Almeida J, Gonçalves M. Associative tag recommendation exploiting multiple textual features. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2011, 1033–1042

    Google Scholar 

  12. Eda T, Yoshikawa M, Uchiyama T, Uchiyama T. The effectiveness of latent semantic analysis for building up a bottom-up taxonomy from folksonomy tags. World Wide Web, 2009, 12: 421–440

    Article  Google Scholar 

  13. Daud A, Li J Z, Zhou L Z, Zhang L, Ding Y, Muhammad F. Modeling ontology of folksonomy with latent semantics of tags. In: Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. 2010, 516–523

    Chapter  Google Scholar 

  14. Sun A, Bhowmick S S. Image tag clarity: in search of visualrepresentative tags for social images. In: Proceedings of the 1st SIGMM Workshop on Social Media. 2009, 19–26

    Google Scholar 

  15. Lin Y, Lin H F, Jin S, Ye Z. Social annotation in query expansion: a machine learning approach. In: Proceedings of the 34th International ACMSIGIR Conference on Research and Development in Information Retrieval. 2011, 405–414

    Google Scholar 

  16. Xu S, Bao S, Fei B, Su Z, Yu Y. Exploring folksonomy for personalized search. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2008, 155–162

    Google Scholar 

  17. Symeonidis P, Nanopoulos A, Manolopoulos Y. A unified framework for providing recommendations in social tagging systems based on ternary semantic analysis. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(2): 179–192

    Article  Google Scholar 

  18. Cai Y, Li Q. Personalized search by tag-based user profile and resource profile in collaborative tagging systems. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management. 2010, 969–978

    Google Scholar 

  19. Brafman R I, Domshlak C. Preference handling — an introductory tutorial. AI Magazine, 2009, 30(1): 58–86

    Article  Google Scholar 

  20. Fürnkranz J, Hüllermeier E. Preference Learning: An Introduction. New York: Springer, 2011

    Book  MATH  Google Scholar 

  21. Minka T, Lafferty J. Expectation-propagation for the generative aspect model. In: Proceedings of the 18th Conference on Uncertainty in Artificial Intelligence. 2002, 352–359

    Google Scholar 

  22. Griffiths T L, Steyvers M. Finding scientific topics. In: Proceedings of the National Academy of Sciences. 2004, 5228–5235

    Google Scholar 

Download references

Acknowledgements

This work was supported by the National Basic Research program of China (2014CB340305), partly by the National Natural Science Foundation of China (Grant Nos. 61300070 and 61421003) and partly by the State Key Lab for Software Development Environment.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chunming Hu.

Additional information

Xiaohui Guo is a PhD student in the School of Computer Science and Engineering, Beihang University, China. His research interests include machine learning, data mining, and services oriented computing. His researches of machine learning are mainly focused on probabilistic graphical model, large-scale Bayesian model inference, and Bayesian sparse learning. Data mining researches include computational advertisements, recommender system, and geospatial temporal data analysis.

Chunming Hu is an associate professor at School of Computer Science and Engineering, Beihang University, China. He received his PhD degree from Beihang University in 2006. He was a post-doctoral research fellow in the distributed system group, Hong Kong University of Science and Technology, China in 2006–2007. His current research interests include the distributed systems, system virtualization, large scale data management and processing systems, and data mining applications.

Richong Zhang received his BS degree and MA degree from Jilin University, China in 2001 and 2004. In 2006, he received his MS degree from Dalhousie University, Canada. He received his PhD from the School of Information Technology and Engineering, University of Ottawa, Canada. He is currently an associate professor in the School of Computer Science and Engineering, Beihang University, China. His reasearch interests include recommender systems, knowledge graph and crowdsourcing.

Jinpeng Huai is a professor of the School of Computer Science and Engineering at Beihang University, China and a vice-minister of Ministry of Industry and Information Technology of the People’s Republic of China. He is an academician of Chinese Academy of Sciences. He used to serve on the Steering Committee for Advanced Computing Technology Subject for the National High-Tech Program (863) as Chief Scientist. His research interests include big data computing, distributed system, virtual computing, service-oriented computing, trustworthiness and security.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guo, X., Hu, C., Zhang, R. et al. A probabilistic framework of preference discovery from folksonomy corpus. Front. Comput. Sci. 11, 1075–1084 (2017). https://doi.org/10.1007/s11704-016-5132-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11704-016-5132-3

Keywords

Navigation