skip to main content
research-article

Learning to Retrieve User Behaviors for Click-through Rate Estimation

Published:08 April 2023Publication History
Skip Abstract Section

Abstract

Click-through rate (CTR) estimation plays a crucial role in modern online personalization services. It is essential to capture users’ drifting interests by modeling sequential user behaviors to build an accurate CTR estimation model. However, as the users accumulate a large amount of behavioral data on the online platforms, the current CTR models have to truncate user behavior sequences and utilize the most recent behaviors, which leads to a problem that sequential patterns such as periodicity or long-term dependency are not contained in the recent behaviors but in far back history. However, it is non-trivial to model the entire user sequence by directly using it for two reasons. Firstly, the very long input sequences will make online inference time and system load infeasible. Secondly, the very long sequences contain much noise, thus making it difficult for CTR models to capture useful patterns effectively. To tackle this issue, we consider it from the input data perspective instead of designing more sophisticated yet complex models. As the entire user behavior sequence contains much noise, it is unnecessary to input the entire sequence. Instead, we could just retrieve only a small part of it as the input to the CTR model. In this article, we propose the User Behavior Retrieval (UBR) framework which aims at learning to retrieve the most informative user behaviors according to each CTR estimation request. Retrieving only a small set of behaviors could alleviate the two problems of utilizing very long sequences (i.e., inference efficiency and noisy input). The distinguishing property of UBR is that it supports arbitrary and learnable retrieval functions instead of utilizing a fixed pre-defined function, which is different from the current retrieval-based methods. Offline evaluations on three large-scale real-world datasets demonstrate the superiority and efficacy of the UBR framework. We further deploy UBR at the Huawei App Store, where it achieves 6.6% of eCPM gain in the online A/B test and now serves the main traffic in the Huawei App Store advertising scenario.

REFERENCES

  1. [1] Beutel Alex, Covington Paul, Jain Sagar, Xu Can, Li Jia, Gatto Vince, and Chi Ed H.. 2018. Latent cross: Making use of context in recurrent recommender systems. In Proceedings of the WSDM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Charikar Moses S.. 2002. Similarity estimation techniques from rounding algorithms. In Proceedings of the 34th Annual ACM Symposium on Theory of Computing. 380388.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. [3] Chen Qiwei, Pei Changhua, Lv Shanshan, Li Chao, Ge Junfeng, and Ou Wenwu. 2021. End-to-end user behavior retrieval in click-through RatePrediction model. CoRR abs/2108.04468 (2021).Google ScholarGoogle Scholar
  4. [4] Chen Qiwei, Zhao Huan, Li Wei, Huang Pipei, and Ou Wenwu. 2019. Behavior sequence transformer for e-commerce recommendation in alibaba. In Proceedings of the 1st DLP-KDD Workshop. 14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Chen Tong, Yin Hongzhi, Nguyen Quoc Viet Hung, Peng Wen-Chih, Li Xue, and Zhou Xiaofang. 2020. Sequence-aware factorization machines for temporal predictive analytics. In Proceedings of the 2020 IEEE 36th International Conference on Data Engineering. IEEE, 14051416.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Chen Xu, Xu Hongteng, Zhang Yongfeng, Tang Jiaxi, Cao Yixin, Qin Zheng, and Zha Hongyuan. 2018. Sequential recommendation with user memory networks. In Proceedings of the WSDM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Cheng Heng-Tze, Koc Levent, Harmsen Jeremiah, Shaked Tal, Chandra Tushar, Aradhye Hrishi, Anderson Glen, Corrado Greg, Chai Wei, Ispir Mustafa, Rohan Anil, Zakaria Haque, Lichan Hong, Vihan Jain, Xiaobing Liu, and Hemal Shah. 2016. Wide and deep learning for recommender systems. In Proceedings of the DLRS@RecSys, ACM, 710.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Choromanski Krzysztof, Likhosherstov Valerii, Dohan David, Song Xingyou, Gane Andreea, Sarlos Tamas, Hawkins Peter, Davis Jared, Mohiuddin Afroz, Kaiser Lukasz, David Belanger, Lucy Colwell, and Adrian Weller. 2020. Rethinking attention with performers. In Proceeding of the ICLR.Google ScholarGoogle Scholar
  9. [9] Ebesu Travis, Shen Bin, and Fang Yi. 2018. Collaborative memory network for recommendation systems. In Proceedings of the SIGIR.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Feng Yufei, Lv Fuyu, Shen Weichen, Wang Menghan, Sun Fei, Zhu Yu, and Yang Keping. 2019. Deep session interest network for click-through rate prediction. In Proceedings of the IJCAI.Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Graves Alex, Wayne Greg, and Danihelka Ivo. 2014. Neural turing machines. CoRR abs/1410.5401 (2014).Google ScholarGoogle Scholar
  12. [12] Guo Huifeng, Tang Ruiming, Ye Yunming, Li Zhenguo, and He Xiuqiang. 2017. DeepFM: A factorization-machine based neural network for CTR prediction. In Proceeding of the IJCAI. 1725–1731Google ScholarGoogle Scholar
  13. [13] He Ruining, Fang Chen, Wang Zhaowen, and McAuley Julian. 2016. Vista: A visually, socially, and temporally-aware model for artistic recommendation. In Proceedings of the RecSys.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] He Ruining and McAuley Julian. 2016. Fusing similarity models with markov chains for sparse sequential recommendation. In Proceedings of the ICDM.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Hidasi Balázs and Karatzoglou Alexandros. 2018. Recurrent neural networks with top-k gains for session-based recommendations. In Proceedings of the CIKM.Google ScholarGoogle Scholar
  16. [16] Hidasi Balázs, Karatzoglou Alexandros, Baltrunas Linas, and Tikk Domonkos. 2016. Session-based recommendations with recurrent neural networks. ICLR.Google ScholarGoogle Scholar
  17. [17] Huang Tongwen, Zhang Zhiqi, and Zhang Junlin. 2019. FiBiNET: Combining feature importance and bilinear feature interaction for click-through rate prediction. In Proceedings of the 13th ACM Conference on Recommender Systems. 169177.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] Jin Jiarui, Chen Xianyu, Zhang Weinan, Huang Junjie, Feng Ziming, and Yu Yong. 2022. Learn over past, evolve for future: Search-based time-aware recommendation with sequential behavior data. In Proceeding of the WWW, ACM, 2451–2461.Google ScholarGoogle Scholar
  19. [19] Jing How and Smola Alexander J.. 2017. Neural survival recommender. In Proceedings of the WSDM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. [20] Juan Yuchin, Zhuang Yong, Chin Wei-Sheng, and Lin Chih-Jen. 2016. Field-aware factorization machines for CTR prediction. In Proceedings of the 10th ACM Conference on Recommender Systems. 4350.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Kang Wang-Cheng and McAuley Julian. 2018. Self-attentive sequential recommendation. In Proceedings of the ICDM.Google ScholarGoogle Scholar
  22. [22] Katharopoulos Angelos, Vyas Apoorv, Pappas Nikolaos, and Fleuret François. 2020. Transformers are rnns: Fast autoregressive transformers with linear attention. In Proceedings of the International Conference on Machine Learning. PMLR, 51565165.Google ScholarGoogle Scholar
  23. [23] Kitaev Nikita, Kaiser Łukasz, and Levskaya Anselm. 2020. Reformer: The Efficient Transformer. In Proceeding of the ICLR.Google ScholarGoogle Scholar
  24. [24] Koren Yehuda. 2009. Collaborative filtering with temporal dynamics. In Proceedings of the KDD.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Li Jiacheng, Wang Yujie, and McAuley Julian. 2020. Time interval aware self-attention for sequential recommendation. In Proceedings of the WSDM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Li Zekun, Cui Zeyu, Wu Shu, Zhang Xiaoyu, and Wang Liang. 2019. Fi-gnn: Modeling feature interactions via graph neural networks for ctr prediction. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 539548.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Lian Jianxun, Zhou Xiaohuan, Zhang Fuzheng, Chen Zhongxia, Xie Xing, and Sun Guangzhong. 2018. xdeepfm: Combining explicit and implicit feature interactions for recommender systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 17541763.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Liu Bin, Tang Ruiming, Chen Yingzhi, Yu Jinkai, Guo Huifeng, and Zhang Yuzhou. 2019. Feature generation by convolutional neural network for click-through rate prediction. In Proceedings of the World Wide Web Conference. 11191129.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Liu Qiang, Wu Shu, Wang Diyi, Li Zhaokang, and Wang Liang. 2016. Context-aware sequential recommendation. In Proceedings of the ICDM.Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Pasricha Rajiv and McAuley Julian. 2018. Translation-based factorization machines for sequential recommendation. In Proceedings of the 12th ACM Conference on Recommender Systems. 6371.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. [31] Pi Qi, Bian Weijie, Zhou Guorui, Zhu Xiaoqiang, and Gai Kun. 2019. Practice on long sequential user behavior modeling for click-through rate prediction. In Proceedings of the SIGKDD.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Qi Pi, Zhu Xiaoqiang, Zhou Guorui, Zhang Yujing, Wang Zhe, Ren Lejian, Fan Ying, and Gai Kun. 2020. Search-based user interest modeling with lifelong sequential behavior data for click-through rate prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Google ScholarGoogle Scholar
  33. [33] Qin Jiarui, Ren Kan, Fang Yuchen, Zhang Weinan, and Yu Yong. 2020. Sequential recommendation with dual side neighbor-based collaborative relation modeling. In Proceedings of the 13th ACM International Conference on Web Search and Data Mining. ACM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Qin Jiarui, Zhang Weinan, Su Rong, Liu Zhirong, Liu Weiwen, Tang Ruiming, He Xiuqiang, and Yu Yong. 2021. Retrieval and interaction machine for tabular data prediction. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 13791389.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] Qin Jiarui, Zhang W., Wu Xin, Jin Jiarui, Fang Yuchen, and Yu Y.. 2020. User behavior retrieval for click-through rate prediction. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Qu Yanru, Cai Han, Ren Kan, Zhang Weinan, Yu Yong, Wen Ying, and Wang Jun. 2016. Product-based neural networks for user response prediction. In Proceedings of the ICDM.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Qu Yanru, Fang Bohui, Zhang Weinan, Tang Ruiming, Niu Minzhe, Guo Huifeng, Yu Yong, and He Xiuqiang. 2018. Product-based neural networks for user response prediction over multi-field categorical data. ACM Transactions on Information Systems 37, 1 (2018), 135.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] Ren Kan, Qin Jiarui, Fang Yuchen, Zhang Weinan, Zheng Lei, Bian Weijie, Zhou Guorui, Xu Jian, Yu Yong, Zhu Xiaoqiang, and Kun Gai. 2019. Lifelong sequential modeling with personalized memorization for user response prediction. In Proceeding of the SIGIR, ACM, 565–574.Google ScholarGoogle Scholar
  39. [39] Ren Pengjie, Chen Zhumin, Li Jing, Ren Zhaochun, Ma Jun, and Rijke Maarten de. 2019. RepeatNet: A repeat aware neural recommendation machine for session-based recommendation. In Proceeding of the AAAI. AAAI Press, 4806–4813.Google ScholarGoogle Scholar
  40. [40] Rendle Steffen. 2010. Factorization machines. In Proceedings of the ICDM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Rendle Steffen, Freudenthaler Christoph, Gantner Zeno, and Schmidt-Thieme Lars. 2009. BPR: Bayesian Personalized Ranking from Implicit Feedback. In UAI. AUAI Press, 452–461.Google ScholarGoogle Scholar
  42. [42] Rendle Steffen, Freudenthaler Christoph, and Schmidt-Thieme Lars. 2010. Factorizing personalized markov chains for next-basket recommendation. In Proceedings of the WWW.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Stephen E. Robertson, Steve Walker, Susan Jones, Micheline M. Hancock-Beaulieu, and Mike Gatford. 1995. Okapi at TREC-3. Nist Special Publication Sp 109, 109 (1995).Google ScholarGoogle Scholar
  44. [44] Sun Fei, Liu Jun, Wu Jian, Pei Changhua, Lin Xiao, Ou Wenwu, and Jiang Peng. 2019. BERT4Rec: Sequential recommendation with bidirectional encoder representations from transformer. In Proceedings of the CIKM. 14411450.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Ta Anh-Phuong. 2015. Factorization machines with follow-the-regularized-leader for CTR prediction in display advertising. In Proceedings of the 2015 IEEE International Conference on Big Data. IEEE.Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. [46] Tang Jiaxi and Wang Ke. 2018. Personalized top-n sequential recommendation via convolutional sequence embedding. In Proceedings of the WSDM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Tay Yi, Dehghani Mostafa, Bahri Dara, and Metzler Donald. 2022. Efficient transformers: A survey. ACM Computing Surveys 55, 6 (2022), 1–28.Google ScholarGoogle Scholar
  48. [48] Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N., Kaiser Łukasz, and Polosukhin Illia. 2017. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems.Google ScholarGoogle Scholar
  49. [49] Villatel Kiewan, Smirnova Elena, Mary Jérémie, and Preux Philippe. 2018. Recurrent neural networks for long and short-term sequential recommendation. CoRR abs/1807.09142 (2018).Google ScholarGoogle Scholar
  50. [50] Wang Qinyong, Yin Hongzhi, Hu Zhiting, Lian Defu, Wang Hao, and Huang Zi. 2018. Neural memory streaming recommender networks with adversarial training. In Proceedings of the KDD.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. [51] Wang Ruoxi, Fu Bin, Fu Gang, and Wang Mingliang. 2017. Deep and cross network for ad click predictions. In Proceedings of the ADKDD’17. 17.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. [52] Wang Xuanhui, Li Cheng, Golbandi Nadav, Bendersky Michael, and Najork Marc. 2018. The lambdaloss framework for ranking metric optimization. In Proceedings of the CIKM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. [53] Widmer Gerhard and Kubat Miroslav. 1996. Learning in the presence of concept drift and hidden contexts. Machine Learning 23, 1 (1996), 69–101.Google ScholarGoogle ScholarCross RefCross Ref
  54. [54] Williams Ronald J.. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning 8, 3 (1992), 229–256.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. [55] Wu Chao-Yuan, Ahmed Amr, Beutel Alex, Smola Alexander J., and Jing How. 2017. Recurrent recommender networks. In Proceedings of the WSDM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. [56] Wu Qitian, Gao Yirui, Gao Xiaofeng, Weng Paul, and Chen Guihai. 2019. Dual sequential prediction models linking sequential recommendation and information dissemination. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 447457.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. [57] Wu Shu, Tang Yuyuan, Zhu Yanqiao, Wang Liang, Xie Xing, and Tan Tieniu. 2019. Session-based recommendation with graph neural networks. In Proceedings of the AAAI.Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. [58] Wu Yongji, Lian Defu, Gong Neil Zhenqiang, Yin Lu, Yin Mingyang, Zhou Jingren, and Yang Hongxia. 2021. Linear-time self attention with codeword histogram for efficient recommendation. In Proceedings of the Web Conference 2021. 12621273.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. [59] Xi Dongbo, Zhuang Fuzhen, Song Bowen, Zhu Yongchun, Chen Shuai, Hong Dan, Chen Tao, Gu Xi, and He Qing. 2020. Neural hierarchical factorization machines for user’s event sequence analysis. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 18931896.Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. [60] Xiao Jun, Ye Hao, He Xiangnan, Zhang Hanwang, Wu Fei, and Chua Tat-Seng. 2017. Attentional factorization machines: Learning the weight of feature interactions via attention networks. In Proceeding of the IJCAI. ijcai.org, 3119–3125.Google ScholarGoogle Scholar
  61. [61] Xu Chengfeng, Zhao Pengpeng, Liu Yanchi, Sheng Victor S., Xu Jiajie, Zhuang Fuzhen, Fang Junhua, and Zhou Xiaofang. 2019. Graph contextualized self-attention network for session-based recommendation. In Proceedings of the IJCAI.Google ScholarGoogle ScholarCross RefCross Ref
  62. [62] Zhang Jiajun, Zhao Yang, Li Haoran, and Zong Chengqing. 2018. Attention with sparsity regularization for neural machine translation and summarization. IEEE/ACM Transactions on Audio, Speech, and Language Processing 27, 3 (2018), 507518.Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. [63] Zhang Weinan, Du Tianming, and Wang Jun. 2016. Deep learning over multi-field categorical data: A case study on user response prediction. In Proceedings of the ECIR.Google ScholarGoogle Scholar
  64. [64] Zhang Weinan, Qin Jiarui, Guo Wei, Tang Ruiming, and He Xiuqiang. 2021. Deep learning for click-through rate estimation. In Proceedings of the IJCAI.Google ScholarGoogle ScholarCross RefCross Ref
  65. [65] Zhou Guorui, Mou Na, Fan Ying, Pi Qi, Bian Weijie, Zhou Chang, Zhu Xiaoqiang, and Gai Kun. 2019. Deep interest evolution network for click-through rate prediction. In Proceedings of the AAAI.Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. [66] Zhou Guorui, Zhu Xiaoqiang, Song Chenru, Fan Ying, Zhu Han, Ma Xiao, Yan Yanghui, Jin Junqi, Li Han, and Gai Kun. 2018. Deep interest network for click-through rate prediction. In Proceedings of the KDD.Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. [67] Zhu Han, Li Xiang, Zhang Pengye, Li Guozheng, He Jie, Li Han, and Gai Kun. 2018. Learning tree-based deep model for recommender systems. In Proceedings of the KDD.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Learning to Retrieve User Behaviors for Click-through Rate Estimation

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Information Systems
      ACM Transactions on Information Systems  Volume 41, Issue 4
      October 2023
      958 pages
      ISSN:1046-8188
      EISSN:1558-2868
      DOI:10.1145/3587261
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 April 2023
      • Online AM: 9 January 2023
      • Accepted: 9 December 2022
      • Revised: 30 August 2022
      • Received: 8 April 2022
      Published in tois Volume 41, Issue 4

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format