Abstract
As multimedia systems like Tiktok and Youtube become increasingly prevalent, there is a growing demand for effective recommendation techniques. However, current recommendation methods often rely on categorical identity features that cannot be shared between different platforms, making fine-tuning models for new scenarios challenging. Displayed content on these platforms often contain multimedia information, leading to a mixture-of-modality (MoM) feedback scenario. In addition, building an effective RS in platforms with smaller data footprints is challenging. To address these challenges, we propose TransRec, a general-purpose model pre-trained on a large-scale recommendation dataset to learn directly from MoM feedback in an end-to-end training approach. TransRec enables transfer learning across various scenarios without relying on shared users or items and can transfer knowledge across modalities, thereby expanding the range of recommendation tasks it can accomplish. We empirically study TransRec’s transferring ability in four real-world recommendation settings from distinct platforms, examining its effects by scaling source and target data size. Our results show that learning neural recommendation models from MoM feedback can realize a promising way to create general-purpose recommender systems. Additionally, we build an MoM dataset (https://github.com/jieWANGforwork/TransRec) for research.
J. Wang—Work was done when Jie Wang was a visiting scholar at Westlake University and intern at Platform and Content Group, Tencent.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
During end-to-end training, we only optimize the top-2 blocks of the item encoder, as it shows similar performance to optimizing all parameters, but is more efficient.
- 2.
- 3.
- 4.
- 5.
- 6.
References
Bommasani, R., et al.: On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258 (2021)
Brown, T., et al.: Language models are few-shot learners. Adv. Neural. Inf. Process. Syst. 33, 1877–1901 (2020)
Chen, L., Yuan, F., Yang, J., He, X., Li, C., Yang, M.: User-specific adaptive fine-tuning for cross-domain recommendations. IEEE Trans. Knowl. Data Eng. 35, 3239–3252 (2021)
Cheng, H.T., et al.: Wide & deep learning for recommender systems. In: Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, pp. 7–10 (2016)
Covington, P., Adams, J., Sargin, E.: Deep neural networks for youtube recommendations. In: Proceedings of the 10th ACM Conference on Recommender Systems, pp. 191–198 (2016)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Ding, H., Ma, Y., Deoras, A., Wang, Y., Wang, H.: Zero-shot recommender systems. arXiv preprint arXiv:2105.08318 (2021)
Geng, S., Liu, S., Fu, Z., Ge, Y., Zhang, Y.: Recommendation as language processing (RLP): a unified pretrain, personalized prompt & predict paradigm (p5). arXiv preprint arXiv:2203.13366 (2022)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
He, R., McAuley, J.: VBPR: visual bayesian personalized ranking from implicit feedback. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 30 (2016)
He, X., Liao, L., Zhang, H., Nie, L., Hu, X., Chua, T.S.: Neural collaborative filtering. In: Proceedings of the 26th International Conference on World Wide Web, pp. 173–182 (2017)
Hidasi, B., Karatzoglou, A., Baltrunas, L., Tikk, D.: Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939 (2015)
Hou, Y., He, Z., McAuley, J., Zhao, W.X.: Learning vector-quantized item representation for transferable sequential recommenders. arXiv preprint arXiv:2210.12316 (2022)
Hou, Y., He, Z., McAuley, J., Zhao, W.X.: Learning vector-quantized item representation for transferable sequential recommenders. In: Proceedings of the ACM Web Conference 2023, pp. 1162–1171 (2023)
Hou, Y., Mu, S., Zhao, W.X., Li, Y., Ding, B., Wen, J.R.: Towards universal sequence representation learning for recommender systems. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 585–593 (2022)
Huang, P.S., He, X., Gao, J., Deng, L., Acero, A., Heck, L.: Learning deep structured semantic models for web search using clickthrough data. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 2333–2338 (2013)
Kang, W.C., McAuley, J.: Self-attentive sequential recommendation. In: 2018 IEEE International Conference on Data Mining (ICDM), pp. 197–206. IEEE (2018)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Krichene, W., Rendle, S.: On sampled metrics for item recommendation. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1748–1757 (2020)
Liu, J., et al.: Exploiting aesthetic preference in deep cross networks for cross-domain recommendation. In: Proceedings of the Web Conference 2020, pp. 2768–2774 (2020)
Liu, K., Xue, F., Guo, D., Wu, L., Li, S., Hong, R.: MEGCF: multimodal entity graph collaborative filtering for personalized recommendation. ACM Trans. Inf. Syst. (TOIS) 41, 1–27 (2022)
Liu, W., Zheng, X., Hu, M., Chen, C.: Collaborative filtering with attribution alignment for review-based non-overlapped cross domain recommendation. In: Proceedings of the ACM Web Conference 2022, pp. 1181–1190 (2022)
Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012–10022 (2021)
Ma, M., Ren, P., Lin, Y., Chen, Z., Ma, J., Rijke, M.D.: -net: a parallel information-sharing network for shared-account cross-domain sequential recommendations. In: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 685–694 (2019)
Man, T., Shen, H., Jin, X., Cheng, X.: Cross-domain recommendation: an embedding and mapping approach. In: IJCAI, vol. 17, pp. 2464–2470 (2017)
Van den Oord, A., Li, Y., Vinyals, O.: Representation learning with contrastive predictive coding. arXiv e-prints pp. arXiv–1807 (2018)
Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: BPR: bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618 (2012)
Sheng, X.R., et al.: One model to serve all: star topology adaptive recommender for multi-domain CTR prediction. In: Proceedings of the 30th ACM International Conference on Information & Knowledge Management, pp. 4104–4113 (2021)
Shin, K., et al.: One4all user representation for recommender systems in e-commerce. arXiv preprint arXiv:2106.00573 (2021)
Tang, H., Liu, J., Zhao, M., Gong, X.: Progressive layered extraction (PLE): a novel multi-task learning (MTL) model for personalized recommendations. In: Fourteenth ACM Conference on Recommender Systems, pp. 269–278 (2020)
Tang, J., Wang, K.: Personalized top-n sequential recommendation via convolutional sequence embedding. In: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, pp. 565–573 (2018)
Tao, Z., Wei, Y., Wang, X., He, X., Huang, X., Chua, T.S.: MGAT: multimodal graph attention network for recommendation. Inf. Process. Manag. 57(5), 102277 (2020)
Wei, Y., Wang, X., He, X., Nie, L., Rui, Y., Chua, T.S.: Hierarchical user intent graph network for multimedia recommendation. IEEE Trans. Multimedia 24, 2701–2712 (2021)
Wei, Y., Wang, X., Nie, L., He, X., Hong, R., Chua, T.S.: MMGCN: multi-modal graph convolution network for personalized recommendation of micro-video. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 1437–1445 (2019)
Wu, C., Wu, F., Qi, T., Huang, Y.: Empowering news recommendation with pre-trained language models. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1652–1656 (2021)
Wu, C., Wu, F., Qi, T., Huang, Y.: End-to-end learnable diversity-aware news recommendation. arXiv preprint arXiv:2204.00539 (2022)
Xiao, S., et al.: Training large-scale news recommenders with pretrained language models in the loop. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 4215–4225 (2022)
Yuan, F., He, X., Karatzoglou, A., Zhang, L.: Parameter-efficient transfer from sequential behaviors for user modeling and recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1469–1478 (2020)
Yuan, F., Karatzoglou, A., Arapakis, I., Jose, J.M., He, X.: A simple convolutional generative network for next item recommendation. In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, pp. 582–590 (2019)
Yuan, F., Zhang, G., Karatzoglou, A., Jose, J., Kong, B., Li, Y.: One person, one model, one world: Learning continual user representation without forgetting. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 696–705 (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, J. et al. (2024). TransRec: Learning Transferable Recommendation from Mixture-of-Modality Feedback. In: Zhang, W., Tung, A., Zheng, Z., Yang, Z., Wang, X., Guo, H. (eds) Web and Big Data. APWeb-WAIM 2024. Lecture Notes in Computer Science, vol 14962. Springer, Singapore. https://doi.org/10.1007/978-981-97-7235-3_13
Download citation
DOI: https://doi.org/10.1007/978-981-97-7235-3_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-7234-6
Online ISBN: 978-981-97-7235-3
eBook Packages: Computer ScienceComputer Science (R0)