skip to main content
10.1145/3477495.3531829acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Clustering based Behavior Sampling with Long Sequential Data for CTR Prediction

Published: 07 July 2022 Publication History

Abstract

Click-through rate (CTR) prediction is fundamental in many industrial applications, such as online advertising and recommender systems. With the development of the online platforms, the sequential user behaviors grow rapidly, bringing us great opportunity to better understand user preferences.However, it is extremely challenging for existing sequential models to effectively utilize the entire behavior history of each user. First, there is a lot of noise in such long histories, which can seriously hurt the prediction performance. Second, feeding the long behavior sequence directly results in infeasible inference time and storage cost. In order to tackle these challenges, in this paper we propose a novel framework, which we name as User Behavior Clustering Sampling (UBCS). In UBCS, short sub-sequences will be obtained from the whole user history sequence with two cascaded modules: (i) Behavior Sampling module samples short sequences related to candidate items using a novel sampling method which takes relevance and temporal information into consideration; (ii) Item Clustering module clusters items into a small number of cluster centroids, mitigating the impact of noise and improving efficiency. Then, the sampled short sub-sequences will be fed into the CTR prediction module for efficient prediction. Moreover, we conduct a self-supervised consistency pre-training task to extract user persona preference and optimize the sampling module effectively. Experiments on real-world datasets demonstrate the superiority and efficiency of our proposed framework.

Supplementary Material

MP4 File (SIGIR22-sp1656.mp4)
With the development of the Internet, the sequential user behaviors grow rapidly, bringing us great opportunity to improve CTR prediction, which is fundamental in lots of industrial applications. To model with long sequential user behaviors accurately and efficiently, we present a focused study on the Click-through rate prediction task with long sequential data, design a novel framework UBCS, and conducted experiments on public Amazon dataset.

References

[1]
Heng Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, and Mustafa and Ispir. 2016. Wide & Deep Learning for Recommender Systems. ACM (2016).
[2]
Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, and Xiuqiang He. 2017. DeepFM: A Factorization-Machine based Neural Network for CTR Prediction. (2017).
[3]
K. He, H. Fan, Y. Wu, S. Xie, and R. Girshick. 2020. Momentum Contrast for Unsupervised Visual Representation Learning. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[4]
Min Hou, Chang Xu, Yang Liu, Weiqing Liu, Jiang Bian, Le Wu, Zhi Li, Enhong Chen, and Tie-Yan Liu. 2021. Stock Trend Prediction with Multi-granularity Data: A Contrastive Learning Approach with Adaptive Fusion. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 700--709.
[5]
Jianqiang Huang, Ke Hu, Qingtao Tang, Mingjian Chen, Yi Qi, Jia Cheng, and Jun Lei. 2021. Deep Position-wise Interaction Network for CTR Prediction. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1885--1889.
[6]
B. Jin, D. Lian, Z. Liu, Q. Liu, J. Ma, X. Xie, and E. Chen. 2020. Sampling-Decomposable Generative Adversarial Recommender. (2020).
[7]
Wang-Cheng Kang and Julian McAuley. 2018. Self-attentive sequential recommendation. In 2018 IEEE International Conference on Data Mining (ICDM). IEEE, 197--206.
[8]
F. Li, Z. Chen, P. Wang, Y. Ren, and X. Zhu. 2021. Graph Intention Network for Click-through Rate Prediction in Sponsored Search. (2021).
[9]
Jiacheng Li, Yujie Wang, and Julian McAuley. 2020. Time interval aware selfattention for sequential recommendation. In Proceedings of the 13th international conference on web search and data mining. 322--330.
[10]
Defu Lian, Qi Liu, and Enhong Chen. 2020. Personalized ranking with importance sampling. In Proceedings of The Web Conference 2020. 1093--1103.
[11]
Chen Ma, Peng Kang, and Xue Liu. 2019. Hierarchical gating networks for sequential recommendation. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 825--833.
[12]
Qi Pi, Weijie Bian, Guorui Zhou, Xiaoqiang Zhu, and Kun Gai. 2019. Practice on long sequential user behavior modeling for click-through rate prediction. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2671--2679.
[13]
Qi Pi, Guorui Zhou, Yujing Zhang, Zhe Wang, Lejian Ren, Ying Fan, Xiaoqiang Zhu, and Kun Gai. 2020. Search-based user interest modeling with lifelong sequential behavior data for click-through rate prediction. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2685--2692.
[14]
Jiarui Qin, Weinan Zhang, Xin Wu, Jiarui Jin, Yuchen Fang, and Yong Yu. 2020. User behavior retrieval for click-through rate prediction. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2347--2356.
[15]
Kan Ren, Jiarui Qin, Yuchen Fang,Weinan Zhang, Lei Zheng,Weijie Bian, Guorui Zhou, Jian Xu, Yong Yu, Xiaoqiang Zhu, et al. 2019. Lifelong sequential modeling with personalized memorization for user response prediction. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 565--574.
[16]
Qiu Ruihong, Li Jingjing, Huang Zi, and Yin Hongzhi. 2019. Rethinking the Item Order in Session-based Recommendation with Graph Neural Networks. ACM (2019).
[17]
Weichen Shen. 2017. DeepCTR: Easy-to-use,Modular and Extendible package of deep-learning based CTR models. https://github.com/shenweichen/deepctr.
[18]
Hao Wang, Defu Lian, Hanghang Tong, Qi Liu, Zhenya Huang, and Enhong Chen. 2021. HyperSoRec: Exploiting Hyperbolic User and Item Representations with Multiple Aspects for Social-aware Recommendation. ACM Transactions on Information Systems (TOIS) (2021).
[19]
Likang Wu, Zhi Li, Hongke Zhao, Zhen Pan, Qi Liu, and Enhong Chen. 2020. Estimating early fundraising performance of innovations via graph-based market environment model. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 6396--6403.
[20]
Wenwen Ye, Shuaiqiang Wang, Xu Chen, Xuepeng Wang, Zheng Qin, and Dawei Yin. 2020. Time matters: Sequential recommendation with complex temporal information. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1459--1468.
[21]
Runlong Yu, Yuyang Ye, Qi Liu, Zihan Wang, Chunfeng Yang, Yucheng Hu, and Enhong Chen. 2021. Xcrossnet: Feature structure-oriented learning for clickthrough rate prediction. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 436--447.
[22]
Xu Yuan, Hongshen Chen, Yonghao Song, Xiaofang Zhao, Zhuoye Ding, Zhen He, and Bo Long. 2021. Improving Sequential Recommendation Consistency with Self-Supervised Imitation. arXiv preprint arXiv:2106.14031 (2021).
[23]
Kai Zhang, Hao Qian, Qing Cui, Qi Liu, Longfei Li, Jun Zhou, Jianhui Ma, and Enhong Chen. 2021. Multi-interactive attention network for fine-grained feature learning in ctr prediction. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 984--992.
[24]
Pu Zhao, Chuan Luo, Cheng Zhou, Bo Qiao, Jiale He, Liangjie Zhang, and Qingwei Lin. 2021. RLNF: Reinforcement Learning based Noise Filtering for Click-Through Rate Prediction. In SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval.
[25]
Guorui Zhou, Na Mou, Ying Fan, Qi Pi, Weijie Bian, Chang Zhou, Xiaoqiang Zhu, and Kun Gai. 2019. Deep interest evolution network for click-through rate prediction. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33. 5941--5948.
[26]
Guorui Zhou, Xiaoqiang Zhu, Chenru Song, Ying Fan, Han Zhu, Xiao Ma, Yanghui Yan, Junqi Jin, Han Li, and Kun Gai. 2018. Deep interest network for click-through rate prediction. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. 1059--1068.

Cited By

View all
  • (2024)TWIN V2: Scaling Ultra-Long User Behavior Sequence Modeling for Enhanced CTR Prediction at KuaishouProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680030(4890-4897)Online publication date: 21-Oct-2024
  • (2024)Context-based Fast Recommendation Strategy for Long User Behavior Sequence in Meituan WaimaiCompanion Proceedings of the ACM on Web Conference 202410.1145/3589335.3648334(355-363)Online publication date: 13-May-2024
  • (2024)Macro Graph Neural Networks for Online Billion-Scale Recommender SystemsProceedings of the ACM Web Conference 202410.1145/3589334.3645517(3598-3608)Online publication date: 13-May-2024
  • Show More Cited By

Index Terms

  1. Clustering based Behavior Sampling with Long Sequential Data for CTR Prediction

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
      July 2022
      3569 pages
      ISBN:9781450387323
      DOI:10.1145/3477495
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 07 July 2022

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. ctr prediction
      2. information retrieval
      3. long sequential user behavior modeling

      Qualifiers

      • Short-paper

      Funding Sources

      Conference

      SIGIR '22
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)103
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 31 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)TWIN V2: Scaling Ultra-Long User Behavior Sequence Modeling for Enhanced CTR Prediction at KuaishouProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680030(4890-4897)Online publication date: 21-Oct-2024
      • (2024)Context-based Fast Recommendation Strategy for Long User Behavior Sequence in Meituan WaimaiCompanion Proceedings of the ACM on Web Conference 202410.1145/3589335.3648334(355-363)Online publication date: 13-May-2024
      • (2024)Macro Graph Neural Networks for Online Billion-Scale Recommender SystemsProceedings of the ACM Web Conference 202410.1145/3589334.3645517(3598-3608)Online publication date: 13-May-2024
      • (2024)Incomplete Data Meets Uncoupled Case: A Challenging Task of Multiview ClusteringIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.322474835:6(8097-8110)Online publication date: Jun-2024
      • (2024)MIFI: Combining Multi-Interest Activation and Implicit Feature Interaction for CTR PredictionsIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.331362211:2(2889-2900)Online publication date: Apr-2024
      • (2023)Interpretable User Retention Modeling in RecommendationProceedings of the 17th ACM Conference on Recommender Systems10.1145/3604915.3608818(702-708)Online publication date: 14-Sep-2023
      • (2023)A Knowledge Enhanced Hierarchical Fusion Network for CTR Prediction under Account Search Scenario in WeChatCompanion Proceedings of the ACM Web Conference 202310.1145/3543873.3584650(475-479)Online publication date: 30-Apr-2023
      • (2023)DMBIN: A Dual Multi-behavior Interest Network for Click-Through Rate Prediction via Contrastive LearningProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591669(1366-1375)Online publication date: 19-Jul-2023
      • (2023)Research on Intelligent Recommendation Technology for Complex Tasks2023 4th International Conference on Computer Engineering and Application (ICCEA)10.1109/ICCEA58433.2023.10135209(353-360)Online publication date: 7-Apr-2023

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media