Abstract
Probabilistic topic models were successfully used to achieve the personalization task using query logs. Thus, both users and previously clicked results are considered when estimating probability distrubutions in order to answer users’queries. However, the proposed models are generally parametric and require to define in advance the number of topics. Moreover, they can not deal with new users. To overcome these limitations, we propose a model called the Hierarchical personalized Dirichlet Processes (HpDP) that personalizes search and allows to automatically learn the number of latent topics. It also addresses the challenging problem of predicting results for new users. We compare our model, with recent topic models and use them to rank online products by their likelihood given a particular user/query pair. Experiments performed on data from a real online products comparator show the effectiveness of our approach.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bennett, P.N., White, R.W., Chu, W., Dumais, S.T., Bailey, P., Borisyuk, F., Cui, X.: Modeling the impact of short- and long-term behavior on search personalization. In: Proceedings of the 35th International Conference on Research and Development in Information Retrieval, SIGIR (2012)
Blei, D., Ng, A., Jordan, M.I., Lafferty, J.: Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)
Chirita, P.A., Nejdl, W., Paiu, R., Kohlschutter, C.: Using odp metadata to personalize search. In: Proceedings of the 28th Annual International Conference on Research and Development in Information Retrieval, SIGIR (2005)
Harvey, M., Crestani, F., Carman, M.: Building user profiles from topic models for personalised search. In: Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, CIKM (2013)
Harvey, M., Ruthven, I., Carman, M.: Improving social bookmark search using personalised latent variable language models. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, WSDM (2011)
Matthijs, N., Radlinski, F.: Personalizing web search using long term browsing history. In: Proceedings of the Fourth International Conference on Web Search and Data Mining, WSDM (2011)
Pretschner, A., Gauch, S.: Ontology based personalized search. In: Proceeding of the International Conference on Tools with Artificial Intelligence, ICTAI (1999)
Qiu, F., Cho, J.: Automatic identification of user interest for personalized search. In: Proceedings of the 15th International Conference on World Wide Web, WWW (2006)
Steyvers, M., Griffiths, T.: Probabilistic Topic Models. In: Landauer, T., Mcnamara, D., Dennis, S., Kintsch, W. (eds.) Latent Semantic Analysis: A Road to Meaning. Laurence Erlbaum (2007)
White, R.W., Bailey, P., Chen, L.: Predicting user interests from contextual information. In: Proceedings of the 32nd International Conference on Research and Development in Information Retrieval, SIGIR (2009)
Nguyen, T., Phung, D., Gupta, S., Venkatesh, S.: Extraction of Latent Patterns and Contexts from Social Honest Signals Using Hierarchical Dirichlet Processes. In: The IEEE International Conference on Pervasive Computing and Communications, PerCom (2013)
Sethuraman, J.: A Constructive Definition of Dirichlet Priors. Statistica Sinica, 4 (1994)
Teh, Y., Jordan, M., Beal, M., Blei, D.: Sharing Clusters Among Related Groups: Hierarchical Dirichlet Processes. In: Neural Information Processing Systems 17, NIPS (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Rochd, E.M., Quafafou, M. (2014). A Nonparametric Mixture Model for Personalizing Web Search. In: Blockeel, H., van Leeuwen, M., Vinciotti, V. (eds) Advances in Intelligent Data Analysis XIII. IDA 2014. Lecture Notes in Computer Science, vol 8819. Springer, Cham. https://doi.org/10.1007/978-3-319-12571-8_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-12571-8_23
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12570-1
Online ISBN: 978-3-319-12571-8
eBook Packages: Computer ScienceComputer Science (R0)