Abstract
The increasing availability of folksonomy data makes them vital for user profiling approaches to precisely detect user preferences and better understand user interests, so as to render some personalized recommendation or retrieval results. This paper presents a rigorous probabilistic framework to discover user preference from folksonomy data. Furthermore, we incorporate three models into the framework with the corresponding inference methods, expectation-maximization or Gibbs sampling algorithms. The user preference is expressed through topical conditional distributions. Moreover, to demonstrate the versatility of our framework, a recommendation method is introduced to show the possible usage of our framework and evaluate the applicability of the engaged models. The experimental results show that, with the help of the proposed framework, the user preference can be effectively discovered.
Similar content being viewed by others
References
Sen S, Vig J, Riedl J. Tagommenders: connecting users to items through tags. In: Proceedings of the 18th International Conference on World Wide Web. 2009, 671–680
Wetzker R, Zimmermann C, Bauckhage C, Albayrak S. I tag, you tag: translating tags for advanced user models. In: Proceedings of the 3rd ACM International Conference onWeb Search and DataMining. 2010, 71–80
Liang H, Xu Y, Li Y, Nayak R, Tao X. Connecting users and items with weighted tags for personalized item recommendations. In: Proceedings of the 21st ACM Conference on Hypertext and Hypermedia. 2010, 51–60
Hofmann T. Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 1999, 50–57
Blei D M, Ng A Y, Jordan M I, Lafferty J. Latent Dirichlet allocation. Journal of Machine Learning Research, 2003, 3: 993–1022
Rosen-Zvi M, Griffiths T L, Steyvers M, Smyth P. The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence. 2004, 487–494
Halpin H, Robu V, Shepherd H. The complex dynamics of collaborative tagging. In: Proceedings of the 16th International Conference on World Wide Web. 2007, 211–220
Markines B, Cattuto C, Menczer F, Benz D, Hotho A, Stumme G. Evaluating similarity measures for emergent semantics of social tagging. In: Proceedings of the 18th International Conference onWorld Wide Web. 2009, 641–650
Xu H, Wang J, Hua X S, Li S. Tag refinement by regularized LDA. In: Proceedings of the 17th ACM International Conference onMultimedia. 2009, 573–576
Yin D W, Xue Z Z, Hong L J, Davison B D. A probabilistic model for personalized tag prediction. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2010, 959–968
Belém F, Martins E, Pontes T, Almeida J, Gonçalves M. Associative tag recommendation exploiting multiple textual features. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2011, 1033–1042
Eda T, Yoshikawa M, Uchiyama T, Uchiyama T. The effectiveness of latent semantic analysis for building up a bottom-up taxonomy from folksonomy tags. World Wide Web, 2009, 12: 421–440
Daud A, Li J Z, Zhou L Z, Zhang L, Ding Y, Muhammad F. Modeling ontology of folksonomy with latent semantics of tags. In: Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. 2010, 516–523
Sun A, Bhowmick S S. Image tag clarity: in search of visualrepresentative tags for social images. In: Proceedings of the 1st SIGMM Workshop on Social Media. 2009, 19–26
Lin Y, Lin H F, Jin S, Ye Z. Social annotation in query expansion: a machine learning approach. In: Proceedings of the 34th International ACMSIGIR Conference on Research and Development in Information Retrieval. 2011, 405–414
Xu S, Bao S, Fei B, Su Z, Yu Y. Exploring folksonomy for personalized search. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2008, 155–162
Symeonidis P, Nanopoulos A, Manolopoulos Y. A unified framework for providing recommendations in social tagging systems based on ternary semantic analysis. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(2): 179–192
Cai Y, Li Q. Personalized search by tag-based user profile and resource profile in collaborative tagging systems. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management. 2010, 969–978
Brafman R I, Domshlak C. Preference handling — an introductory tutorial. AI Magazine, 2009, 30(1): 58–86
Fürnkranz J, Hüllermeier E. Preference Learning: An Introduction. New York: Springer, 2011
Minka T, Lafferty J. Expectation-propagation for the generative aspect model. In: Proceedings of the 18th Conference on Uncertainty in Artificial Intelligence. 2002, 352–359
Griffiths T L, Steyvers M. Finding scientific topics. In: Proceedings of the National Academy of Sciences. 2004, 5228–5235
Acknowledgements
This work was supported by the National Basic Research program of China (2014CB340305), partly by the National Natural Science Foundation of China (Grant Nos. 61300070 and 61421003) and partly by the State Key Lab for Software Development Environment.
Author information
Authors and Affiliations
Corresponding author
Additional information
Xiaohui Guo is a PhD student in the School of Computer Science and Engineering, Beihang University, China. His research interests include machine learning, data mining, and services oriented computing. His researches of machine learning are mainly focused on probabilistic graphical model, large-scale Bayesian model inference, and Bayesian sparse learning. Data mining researches include computational advertisements, recommender system, and geospatial temporal data analysis.
Chunming Hu is an associate professor at School of Computer Science and Engineering, Beihang University, China. He received his PhD degree from Beihang University in 2006. He was a post-doctoral research fellow in the distributed system group, Hong Kong University of Science and Technology, China in 2006–2007. His current research interests include the distributed systems, system virtualization, large scale data management and processing systems, and data mining applications.
Richong Zhang received his BS degree and MA degree from Jilin University, China in 2001 and 2004. In 2006, he received his MS degree from Dalhousie University, Canada. He received his PhD from the School of Information Technology and Engineering, University of Ottawa, Canada. He is currently an associate professor in the School of Computer Science and Engineering, Beihang University, China. His reasearch interests include recommender systems, knowledge graph and crowdsourcing.
Jinpeng Huai is a professor of the School of Computer Science and Engineering at Beihang University, China and a vice-minister of Ministry of Industry and Information Technology of the People’s Republic of China. He is an academician of Chinese Academy of Sciences. He used to serve on the Steering Committee for Advanced Computing Technology Subject for the National High-Tech Program (863) as Chief Scientist. His research interests include big data computing, distributed system, virtual computing, service-oriented computing, trustworthiness and security.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Guo, X., Hu, C., Zhang, R. et al. A probabilistic framework of preference discovery from folksonomy corpus. Front. Comput. Sci. 11, 1075–1084 (2017). https://doi.org/10.1007/s11704-016-5132-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11704-016-5132-3