ABSTRACT
In modern commercial search engines, the pay-per-click (PPC) advertising model is widely used in sponsored search. The search engines try to deliver ads which can produce greater click yields (the total number of clicks for the list of ads per impression). Therefore, predicting user clicks plays a critical role in sponsored search. The current ad-delivery strategy is a two-step approach which first predicts individual ad CTR for the given query and then selects the ads with higher predicted CTR. However, this strategy is naturally suboptimal and correlation between ads is often ignored under this strategy. The learning problem is focused on predicting individual performance rather than group performance which is the more important measurement.
In this paper, we study click yield measurement in sponsored search and focus on the problem---predicting group performance (click yields) in sponsored search. To tackle all challenges in this problem---depth effects, interactive influence, cold start and sparseness of ad textual information---we first investigate several effects and propose a novel framework that could directly predict group performance for lists of ads. Our extensive experiments on a large-scale real-world dataset from a commercial search engine show that we achieve significant improvement by solving the sponsored search problem from the new perspective. Our methods noticeably outperform existing state-of-the-art approaches.
- D. M. Blei and J. D. McAuliffe. Supervised topic models. In NIPS, 2007.Google ScholarDigital Library
- D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, January 2003. Google ScholarDigital Library
- G. Bouchard, D. Yin, and S. Guo. Convex collective matrix factorization. In Proceedings of AISTATS 2013.Google Scholar
- A. Broder and V. Josifovski. Introduction to computational advertising, 2011. Retrieved from http://www.stanford.edu/class/msande239/.Google Scholar
- B. Cao, D. Shen, K. Wang, and Q. Yang. Clickthrough log analysis by collaborative ranking. In AAAI, 2010.Google Scholar
- O. Chapelle and Y. Zhang. A dynamic bayesian network click model for web search ranking. In WWW '09. Google ScholarDigital Library
- D. Chen, W. Chen, H. Wang, Z. Chen, and Q. Yang. Beyond ten blue links: enabling user click modeling in federated web search. In Proceedings of WSDM '2, 2012. Google ScholarDigital Library
- Y. Chen and T. W. Yan. Position-normalized click prediction in search advertising. In Proceedings of KDD '12. Google ScholarDigital Library
- H. Cheng and E. Cantú-Paz. Personalized click prediction in sponsored search. In Proceedings of WSDM '10. Google ScholarDigital Library
- N. Craswell, O. Zoeter, M. Taylor, and B. Ramsey. An experimental comparison of click position-bias models. In WSDM '08. Google ScholarDigital Library
- H. Drucker, C. J. C. Burges, L. Kaufman, A. J. Smola, and V. Vapnik. Support vector regression machines. In NIPS, pages 155--161, 1996.Google Scholar
- J. Duchi, S. Shalev-Shwartz, Y. Singer, and T. Chandra. Efficient projections onto the l1-ball for learning in high dimensions. In Proceedings of ICML '08, 2008. Google ScholarDigital Library
- G. E. Dupret and B. Piwowarski. A user browsing model to predict search engine click data from past observations. In SIGIR '08. Google ScholarDigital Library
- T. Graepel, J. Q. Candela, T. Borchert, and R. Herbrich. Web-scale bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine. In ICML '10.Google Scholar
- F. Guo, C. Liu, A. Kannan, T. Minka, M. Taylor, Y.-M. Wang, and C. Faloutsos. Click chain model in web search. In WWW '09. Google ScholarDigital Library
- F. Guo, C. Liu, and Y. M. Wang. Efficient multiple-click models in web search. In Proceedings of WSDM '09, 2009. Google ScholarDigital Library
- A. K. Gupta and D. K. Nagar. Matrix Variate Distributions. Chapman & Hall, 2000.Google Scholar
- D. Hillard, S. Schroedl, E. Manavoglu, H. Raghavan, and C. Leggetter. Improving ad relevance in sponsored search. In Proceedings of WSDM '10 Google ScholarDigital Library
- T. Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42(1--2):177--196, January-February 2001. Google ScholarDigital Library
- T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting click through data as implicit feedback. In Proceedings of SIGIR '05, 2005. Google ScholarDigital Library
- Y. Koren. Factor in the neighbors: Scalable and accurate collaborative filtering. ACM Transactions on Knowledge Discovery from Data, 4(1):1--24, January 2010. Google ScholarDigital Library
- Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. Computer, 42(8):30--37, 2009. Google ScholarDigital Library
- C. Liu, F. Guo, and C. Faloutsos. Bbm: bayesian browsing model from petabyte-scale data. In KDD '09, 2009. Google ScholarDigital Library
- C. Liu, F. Guo, and C. Faloutsos. Bayesian browsing model: Exact inference of document relevancfe from petabyte-scale data. ACM Trans. Knowl. Discov. Data, 4(4):19:1--19:26, Oct. 2010. Google ScholarDigital Library
- T.-Y. Liu. Learning to rank for information retrieval. Found. Trends Inf. Retr., 3(3):225--331, Mar. 2009. Google ScholarDigital Library
- A. K. Menon, K.-P. Chitrapura, S. Garg, D. Agarwal, and N. Kota. Response prediction using collaborative filtering with hierarchies and side-information. In KDD '11. Google ScholarDigital Library
- T. Minka, J. Winn, J. Guiver, and A. Kannan. A click through model - sample code. http://research.microsoft.com/en-us/um/cambridge/projects/infernet/docs/Click%20through%20model%20sample.aspx, 2009.Google Scholar
- T. Moon, A. Smola, Y. Chang, and Z. Zheng. Intervalrank: isotonic regression with listwise and pairwise constraints. In WSDM '10. Google ScholarDigital Library
- S. Rendle. Factorization machines with libFM. ACM Trans. Intell. Syst. Technol., 3(3):57:1--57:22, May 2012. Google ScholarDigital Library
- M. Richardson, E. Dominowska, and R. Ragno. Predicting clicks: estimating the click-through rate for new ads. In WWW '07. Google ScholarDigital Library
- R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In NIPS 21, pages 1257--1264. 2008.Google Scholar
- S. Shen, B. Hu, W. Chen, and Q. Yang. Personalized click model through collaborative filtering. In WSDM '12. Google ScholarDigital Library
- R. Srikant, S. Basu, N. Wang, and D. Pregibon. User browsing models: relevance versus examination. In KDD '10. Google ScholarDigital Library
- N. Usunier, D. Buffoni, and P. Gallinari. Ranking with ordered weighted pairwise classification. In ICML '09. Google ScholarDigital Library
- C. Wang and D. M. Blei. Collaborative topic modeling for recommending scientific articles. In KDD '11, 2011. Google ScholarDigital Library
- X. Wang and et al. Click-through prediction for sponsored search advertising with hybrid models. In KDDCUP 2012, 2012.Google Scholar
- J. Weston, S. Bengio, and N. Usunier. Large scale image annotation: learning to rank with joint word-image embeddings. Mach. Learn., 81(1):21--35, Oct. 2010. Google ScholarDigital Library
- J. Weston, C. Wang, R. Weiss, and A. Berenzweig. Latent collaborative retrieval. In ICML, 2012.Google Scholar
- C. Xiong, T. Wang, W. Ding, Y. Shen, and T.-Y. Liu. Relational click prediction for sponsored search. In Proceedings of WSDM '12, 2012. Google ScholarDigital Library
- W. Xu, E. Manavoglu, and E. Cantu-Paz. Temporal click model for sponsored search. In SIGIR '10, 2010. Google ScholarDigital Library
- D. Yin, S. Guo, B. Chidlovskii, B. D. Davison, C. Archambeau, and G. Bouchard. Connecting comments and tags: Improved modeling of social tagging systems. In Proceedings of WSDM '13. Google ScholarDigital Library
- W. V. Zhang and R. Jones. Comparing click logs and editorial labels for training query rewriting. In Query Log Analysis: Social And Technological Challenges. A workshop at WWW 2007.Google Scholar
- Y. Zhang, W. Chen, D. Wang, and Q. Yang. User-click modeling for understanding and predicting search-behavior. In KDD '11. Google ScholarDigital Library
- J. Zhu, A. Ahmed, and E. P. Xing. Medlda: maximum margin supervised topic models for regression and classification. In Proceedings of ICML '09, 2009. Google ScholarDigital Library
- J. Zhu and E. Xing. Sparse topical coding. In UAI '11, 2011.Google Scholar
- Z. A. Zhu, W. Chen, T. Minka, C. Zhu, and Z. Chen. A novel click model and its applications to online advertising. In WSDM '10, 2010. Google ScholarDigital Library
Index Terms
- Estimating ad group performance in sponsored search
Recommendations
Sponsored Search: Is Money a Motivator for Providing Relevant Results?
Analysis of data from a major metasearch engine reveals that sponsored-link click-through rates appear lower than previously reported. Combining sponsored and nonsponsored links in a single listing, while providing some benefits to users, does not ...
Online learning from click data for sponsored search
WWW '08: Proceedings of the 17th international conference on World Wide WebSponsored search is one of the enabling technologies for today's Web search engines. It corresponds to matching and showing ads related to the user query on the search engine results page. Users are likely to click on topically related ads and the ...
Investigating the relevance of sponsored results for web ecommerce queries
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrievalAre sponsored links, the primary business model for Web search engines, providing Web consumers with relevant results? This research addresses this issue by investigating the relevance of sponsored and non-sponsored links for ecommerce queries from the ...
Comments