Abstract
Large-scale industrial recommendation systems (RS) usually confront computational problems due to the enormous corpus size. Hence, an efficient indexing structure is a practical solution to retrieve and recommend the most relevant items within a limited response time. The existing approaches that adopted embedding or tree-based index structures cannot handle the long-tail phenomenon. To address this issue, we propose a HIerarchical Tree-based model with variable-length layers (HIT) for recommendation systems. HIT consists of a hierarchical tree index structure and a user preference prediction model. It can fully exploit all the training data by dynamically adjusting the lengths of layers in its tree index structure, which can effectively alleviate the long-tail problem. To assess the models’ resistance against the long-tail problem, we further define two types of equilibrium under our index structure. To satisfy the equilibrium, we propose a corresponding hierarchical tree learning algorithm. Furthermore, for those items with a rare appearance in the training data, on which the learning algorithm would fail, we design a dedicated bandit layer to solve them. Extensive experiments on three large-scale real-world datasets show that HIT can significantly outperform the existing methods in terms of efficient recommendations on items with different frequencies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abdollahpouri, H.: Popularity bias in ranking and recommendation. In: AAAI/ACM Conference on AI, Ethic and Society (AIES), pp. 529–530 (2019)
Abdollahpouri, H., Burke, R., Mobasher, B.: Controlling popularity bias in learning-to-rank recommendation. In: 11th ACM Conference on Recommender Systems (RecSys), pp. 42–46 (2017)
Chen, Z., Xiao, R., Li, C., Ye, G., Sun, H., Deng, H.: ESAM: discriminative domain adaptation with non-displayed items to improve long-tail performance. In: The 43rd International ACM SIGIR conference on research and development in Information Retrieval (SIGIR), pp. 579–588 (2020)
Chu, W., et al.: A case study of behavior-driven conjoint analysis on yahoo! Front page today module. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 1097–1104 (2009)
Covington, P., Adams, J., Sargin, E.: Deep neural networks for youtube recommendations. In: Proceedings of the 10th ACM Conference on Recommender Systems (RecSys), pp. 191–198 (2016)
Gao, W., et al.: Learning an end-to-end structure for retrieval in large-scale recommendations. In: The 30th ACM International Conference on Information and Knowledge Management (CIKM) (2021)
Guo, H., Tang, R., Ye, Y., Li, Z., He, X.: DeepFM: a factorization-machine based neural network for CTR prediction. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), pp. 1725–1731 (2017)
Harper, F.M., Konstan, J.A.: The movielens datasets: history and context. TiiS 5(4), 1–19 (2015)
He, M., Li, C., Hu, X., Chen, X., Wang, J.: Mitigating popularity bias in recommendation via counterfactual inference. In: Database Systems for Advanced Applications (DASFAA), pp. 377–388 (2022)
Huang, P.S., He, X., Gao, J., Deng, L., Acero, A., Heck, L.: Learning deep structured semantic models for web search using clickthrough data. In: Proceedings of the 22nd ACM International Conference on Conference on Information and Knowledge Management (CIKM), pp. 2333–2338 (2013)
Hurley, N., Zhang, M.: Novelty and diversity in top-N recommendation-analysis and evaluation. TOIT 10(4), 1–30 (2011)
Jegou, H., Douze, M., Schmid, C.: Product quantization for nearest neighbor search. TPAMI 33(1), 117–128 (2010)
Kamishima, T., Akaho, S., Asoh, H., Sakuma, J.: Correcting popularity bias by enhancing recommendation neutrality. In: The 8th ACM Conference on Recommender Systems (RecSys), Posters (2014)
Li, L., Chu, W., Langford, J., Schapire, R.E.: A contextual-bandit approach to personalized news article recommendation. In: Proceedings of the 19th International Conference on World Wide Web (WWW), pp. 661–670 (2010)
Ma, X., et al.: Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR), pp. 1137–1140 (2018)
Malkov, Y.A., Yashunin, D.A.: Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs. TPAMI 42(4), 824–836 (2018)
Park, Y.J., Tuzhilin, A.: The long tail of recommender systems and how to leverage it. In: ACM Conference on Recommender Systems (RecSys), pp. 11–18 (2008)
Rendle, S.: Factorization machines. In: 2010 IEEE International Conference on Data Mining (ICDM), pp. 995–1000 (2010)
Shi, L.: Trading-off among accuracy, similarity, diversity, and long-tail: a graph-based recommendation approach. In: Proceedings of the 7th ACM Conference on Recommender Systems (RecSys), pp. 57–64 (2013)
Shrivastava, A., Li, P.: Asymmetric LSH (ALSH) for sublinear time maximum inner product search (MIPS). In: Advances in Neural Information Processing Systems (NeurIPS), pp. 2321–2329 (2014)
Wang, R., Fu, B., Fu, G., Wang, M.: Deep & cross network for ad click predictions. In: Proceedings of the ADKDD, pp. 1–7 (2017)
Xie, R., Qiu, Z., Rao, J., Liu, Y., Zhang, B., Lin, L.: Internal and contextual attention network for cold-start multi-channel matching in recommendation. In: Proceedings of the 29h International Joint Conference on Artificial Intelligence (IJCAI), pp. 2732–2738 (2020)
Yang, L., Cui, Y., Xuan, Y., Wang, C., Belongie, S., Estrin, D.: Unbiased offline recommender evaluation for missing-not-at-random implicit feedback. In: Proceedings of the 12th ACM Conference on Recommender Systems (RecSys), pp. 279–287 (2018)
Yin, H., Cui, B., Li, J., Yao, J., Chen, C.: Challenging the long tail recommendation. Proc. VLDB Endow. 5(9), 896–907 (2012)
Zheng, Y., Gao, C., Li, X., He, X., Li, Y., Jin, D.: Disentangling user interest and popularity bias for recommendation with causal embedding. arXiv preprint arXiv:2006.11011 (2020)
Zhou, G., et al.: Deep interest network for click-through rate prediction. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (SIGKDD), pp. 1059–1068 (2018)
Zhu, H., et al.: Joint optimization of tree-based index and deep model for recommender systems. In: Advances in Neural Information Processing Systems (NeurIPS), pp. 3971–3980 (2019)
Zhu, H., et al.: Learning tree-based deep model for recommender systems. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (SIGKDD), pp. 1079–1088 (2018)
Zhuo, J., Xu, Z., Dai, W., Zhu, H., Li, H., Xu, J., Gai, K.: Learning optimal tree models under beam search. In: International Conference on Machine Learning (ICML), pp. 11650–11659 (2020)
Acknowledgements
This work was supported in part by National Key R &D Program of China No. 2020YFB1707900, in part by China NSF grant No. 62132018, U2268204, 62272307 61902248, 61972254, 61972252, 62025204, 62072303, in part by Shanghai Science and Technology fund 20PJ1407900, in part by Alibaba Group through Alibaba Innovative Research Program, and in part by Tencent Rhino Bird Key Research Project. The opinions, findings, conclusions, and recommendations expressed in this paper are those of the authors and do not necessarily reflect the views of the funding agencies or the government.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Xu, A. et al. (2023). HIT: Learning a Hierarchical Tree-Based Model with Variable-Length Layers for Recommendation Systems. In: Wang, X., et al. Database Systems for Advanced Applications. DASFAA 2023. Lecture Notes in Computer Science, vol 13944. Springer, Cham. https://doi.org/10.1007/978-3-031-30672-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-031-30672-3_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-30671-6
Online ISBN: 978-3-031-30672-3
eBook Packages: Computer ScienceComputer Science (R0)