skip to main content
10.1145/3106426.3109037acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
research-article

Improving click-through rate prediction accuracy in online advertising by transfer learning

Published:23 August 2017Publication History

ABSTRACT

As the main revenue source of Internet companies, online advertising is always a significant topic, where click-through rate (CTR) prediction plays a central role. In online advertising systems, there are often many advertisement products. Due to the competition in the bidding mechanism, some advertising products may get lots of data to train the CTR prediction model while some may lack high-quality data. However, to predict accurate CTR, a large amount of data is needed. Therefore, transfer knowledge from the large product (source) to the small product (target) is necessary. We propose a transfer learning method that iteratively updates the data weights to selectively combine source data with target data for training. To efficiently process huge advertisement data, we design a sampling strategy based on the gradient information, and implement the algorithm with a MapReduce-like machine learning framework. We do experiments on real advertisement datasets. The results show that our approach improves the accuracy of CTR prediction compared to the supervised learning method.

References

  1. Andreas Argyriou, Theodoras Evgeniou, and Massimiliano Pontil. 2007. Multitask feature learning. Advances in Neural Information Processing Systems 19 (2007), 41.Google ScholarGoogle Scholar
  2. John Blitzer, Mark Dredze, Fernando Pereira, and others. 2007. Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In Annual Meeting of the Association of Computational Linguistics, Vol. 7. 440--447.Google ScholarGoogle Scholar
  3. Deepayan Chakrabarti, Deepak Agarwal, and Vanja Josifovski. 2008. Contextual advertising by combining relevance with click feedback. In International Conference on World Wide Web. ACM, 417--426. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Olivier Chapelle, Eren Manavoglu, and Romer Rosales. 2015. Simple and scalable response prediction for display advertising. ACM Transactions on Intelligent Systems and Technology 5, 4 (2015), 61. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Ye Chen and Tak W Yan. 2012. Position-normalized click prediction in search advertising. In International Conference on Knowledge Discovery and Data Mining. ACM, 795--803. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Haibin Cheng and Erick Cantú-Paz. 2010. Personalized click prediction in sponsored search. In International Conference on Web Search and Data Mining. ACM, 351--360. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Haibin Cheng, Roelof van Zwol, Javad Azimi, Eren Manavoglu, Ruofei Zhang, Yang Zhou, and Vidhya Navalpakkam. 2012. Multimedia features for click prediction of new ads in display advertising. In Iinternational Conference on Knowledge Discovery and Data Mining. ACM, 777--785. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Wenyuan Dai, Qiang Yang, Gui Rong Xue, and Yong Yu. 2007. Boosting for transfer learning. In International Conference on Machine Learning. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Brian Dalessandro, Daizhuo Chen, Troy Raeder, Claudia Perlich, Melinda Han Williams, and Foster Provost. 2014. Scalable hands-free transfer learning for online advertising. In International Conference on Knowledge Discovery and Data Mining. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kushal S Dave and Vasudeva Varma. 2010. Learning the click-through rate for rare/new ads from similar ads. In International Conference on Research and Development in Information Retrieval. ACM, 897--898. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Theodoros Evgeniou and Massimiliano Pontil. 2004. Regularized multi-task learning. In International Conference on Knowledge Discovery and Data Mining. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yoav Freund and Robert E Schapire. 1995. A desicion-theoretic generalization of on-line learning and an application to boosting. In European Conference on Computational Learning Theory. Springer, 23--37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Thore Graepel, Joaquin Q Candela, Thomas Borchert, and Ralf Herbrich. 2010. Web-scale bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine. In International Conference on Machine Learning. 13--20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jing Jiang and ChengXiang Zhai. 2007. Instance weighting for domain adaptation in NLP. In Annual Meeting of the Association of Computational Linguistics, Vol. 7. 264--271.Google ScholarGoogle Scholar
  15. Neil D Lawrence and John C Platt. 2004. Learning to learn with the informative vector machine. In International Conference on Machine Learning. ACM, 65. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Yandong Liu, Sandeep Pandey, Deepak Agarwal, and Vanja Josifovski. 2012. Finding the right consumer: optimizing for conversion in display advertising campaigns. In International Conference on Web Search and Data Mining. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Mingsheng Long and Jianmin Wang. 2015. Learning transferable features with deep adaptation networks. CoRR, abs/1502.02791 1 (2015), 2.Google ScholarGoogle Scholar
  18. H Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, and others. 2013. Ad click prediction: a view from the trenches. In International Conference on Knowledge Discovery and Data Mining. ACM, 1222--1230. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Lilyana Mihalkova, Tuyen Huynh, and Raymond J. Mooney. 2010. Mapping and revising Markov logic networks for transfer learning. In AAAI Conference on Artificial Intelligence. 608--614. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Lilyana Mihalkova and Raymond J Mooney. 2008. Transfer learning by mapping with minimal target data. In AAAT-08 Workshop on Transfer Learning for Complex Tasks.Google ScholarGoogle Scholar
  21. Sinno Jialin Pan and Qiang Yang. 2010. A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering 22, 10 (2010), 1345--1359. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Claudia Perlich, Brian Dalessandro, Troy Raeder, Ori Stitelman, and Foster Provost. 2014. Machine learning for targeted display advertising: Transfer learning in action. Machine learning 95, 1 (2014), 103--127. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Moira Regelson and D Fain. 2006. Predicting click-through rate using keyword clusters. In Proceedings of the Second Workshop on Sponsored Search Auctions, Vol. 9623.Google ScholarGoogle Scholar
  24. Jason Yosinski, Jeff Clune, Yoshua Bengio, and Hod Lipson. 2014. How transferable are features in deep neural networks?. In Advances in Neural Information Processing Systems. 3320--3328. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Peilin Zhao and Tong Zhang. 2015. Stochastic Optimization with Importance Sampling for Regularized Loss Minimization. In International Conference on Machine Learning. 1--9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Yin Zhu, Yuqiang Chen, Zhongqi Lu, Sinno Jialin Pan, Gui Rong Xue, Yong Yu, and Qiang Yang. 2011. Heterogeneous transfer learning for image classification. In AAAI Conference on Artificial Intelligence. 1304--1309. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Improving click-through rate prediction accuracy in online advertising by transfer learning

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        WI '17: Proceedings of the International Conference on Web Intelligence
        August 2017
        1284 pages
        ISBN:9781450349512
        DOI:10.1145/3106426

        Copyright © 2017 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 23 August 2017

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        WI '17 Paper Acceptance Rate118of178submissions,66%Overall Acceptance Rate118of178submissions,66%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader