Abstract
Making use of visual perception in recommender systems is becoming more and more important. In the existing visual recommendation models (VRMs), the visual features of items are usually extracted based on the pre-trained convolutional neural network, and then combined with the non-visual features modeling to complete the prediction of users’ interest. There are two challenges in this field so far. First, most VRMs are developed around single-image items, and how to more effectively mine the visual features of multi-image items is seldom considered. Second, most models do not consider the distribution difference between the training datasets of the pre-trained model and the datasets for recommendation when extracting visual features based on the pre-training model, which may deepen the gap in the convolutional neural network’s understanding of image semantics on datasets. To address the above challenges, a Multi-image Visual Recommendation Model based on a Gated Neural Network (GMiRec) is proposed. It performs different forms of pooling operations on the visual features of multi-image items and uses the feed-forward neural network to realize the fusion of the multi-image visual information. In addition, a gated neural network taking item categories as input is designed to achieve supervised dimensionality reduction on the item visual features, which alleviates the problem of semantic gap. Experiments conducted on the Amazon datasets show that the proposed model is significantly improved compared with the existing models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhang, S., Yao, L., Sun, A., et al.: Deep learning based recommender system: a survey and new perspectives. ACM Comput. Surv. 52(1), 1–38 (2019)
Linden, G., Smith, B., York, J.: Item-to-item collaborative filtering. IEEE Internet Comput. 7(1), 76–80 (2003)
Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8), 30–37 (2009)
Koren, Y.: Factor in the neighbors: scalable and accurate collaborative filtering. ACM Trans. Knowl. Discov. Data 4(1), 1–24 (2010)
Pan, R., Zhou, Y., Cao, B., et al.: One-class collaborative filtering. In: 8th IEEE International Conference on Data Mining, pp. 502–511. IEEE, Piscataway (2008)
He, X., Liao, L., Zhang, H., et al.: Neural collaborative filtering. In: 26th International Conference on World Wide Web Companion, pp.173–182. ACM, New York (2017)
He, X., Du, X., Wang, X., et al.: Outer product-based neural collaborative filtering. In: 27th International Joint Conference on Artificial Intelligence, pp. 2227–2233. ACM, New York (2018)
Truong, Q., Salah, A., Lauw, H.: Multi-modal recommender systems: hands-on exploration. In: 15th ACM Conference on Recommender Systems, pp. 834–837. ACM, New York (2021)
Liu, H., Lu, J., Yang, H., et al.: Category-specific CNN for visual-aware CTR prediction at JD. Com. In: 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2686–2696. ACM, New York (2020)
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: 13th IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778. IEEE, Piscataway (2016)
Mo, K., Liu, B., Xiao, L., Li, Y., Jiang, J.: Image feature learning for cold start problem in display advertising. In: 24th International Conference on Artificial Intelligence, pp. 3728–3734. IJCAI, Buenos Aires Argentina (2015)
Zhao, Z., Li, L., Zhang, B., et al.: What you look matters? Offline evaluation of advertising creatives for cold-start problem. In: 28th ACM International Conference on Information and Knowledge Management, pp. 2605–2613. ACM, New York (2019)
He, R., Mcauley, J.: VBPR: visual Bayesian personalized ranking from implicit feed-back. In: 16th AAAI Conference on Artificial Intelligence, pp. 144–150. AAAI, Menlo Park (2016)
Liu, Q., Wu, S., Wang, L.: Deepstyle: learning user preferences for visual recommendation. In: 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 841–844. ACM, New York (2017)
He, R., Mcauley, J.: Ups and downs: modeling the visual evolution of fashion trends with one-class collaborative filtering. In: 25th International Conference on World Wide Web, pp. 507–517. WWW, Switzerland (2016)
Yu, W., Zhang, H., He, X., et al.: Aesthetic-based clothing recommendation. In: 18th World Wide Web Conference, pp. 649–658. WWW, Switzerland (2018)
Yu, W., et al.: Visually aware recommendation with aesthetic features. VLDB J. 30(4), 495–513 (2021). https://doi.org/10.1007/s00778-021-00651-y
Zhang, F., Yuan, N., Lian, D., et al.: Collaborative knowledge base embedding for recommender systems. In: 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 353–362. ACM, New York (2016)
Chen, X., Zhao, P., Liu, Y., et al.: Exploiting visual contents in posters and still frames for movie recommendation. IEEE Access 6, 68874–68881 (2018)
Wu, C., Wu, F., Qi, T., et al.: MM-Rec: visiolinguistic model empowered multimodal news recommendation. In: 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2560–2564. ACM, New York (2022)
Lei, C., Liu, D., Li, W., et al.: Comparative deep learning of hybrid representations for image recommendations. In: 13th Conference on Computer Vision and Pattern Recognition, pp. 2545–2553. IEEE, Piscataway (2016)
Yilma, B., Leiva, L.: CuratorNet: visually-aware recommendation of art images. In: 23th Conference on Human Factors in Computing Systems, pp. 1–17. ACM, New York (2023)
Sandler, M., Howard, A., Zhu, M., et al.: MobileNetV2: inverted residuals and linear bottlenecks. In: 15th IEEE Conference on Computer Vision and Pattern Recognition, pp. 4510–1520. IEEE, Piscataway (2018)
Boureau, Y., Ponce, J., LeCun, Y.: A theoretical analysis of feature pooling in visual recognition, pp. 111–118. Omnipress, United States (2010)
Hein, A.: Identification and bridging of semantic gaps in the context of multi-domain engineering. In: 2010 Forum on Philosophy, Engineering & Technology, pp. 57–58 (2010)
Rendle, S., Feudenthaler, C., Ganther, Z., et al.: BPR: Bayesian personalized ranking from implicit feedback. In: 25th Conference on Uncertainty in Artificial Intelligence, pp. 452–461. ACM, New York (2009)
Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pp. 188–197. ACL, Stroudsburg (2019)
Jonathan, O.: Product recommendation based on visual similarity. https://www.kaggle.com/code/jonathanoheix/product-recommendation-based-on-visual-similarity. Accessed 1 Feb 2023
Niu, W., Ccaverlee, J., Lu, H.: Neural personalized ranking for image recommendation. In: 8th ACM International Conference on Web Search and Data Mining. ACM, New York (2018)
Microsoft. Neural Network Intelligence. http://github.com/Microsoft/nni. Accessed 1 Feb 2023
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Acknowledgement
This work was supported by the National Natural Science Foundation of China (Nos. 62077038, 61672405, 62176196 and 62271374).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mu, C., Tang, X., Luo, J., Liu, Y. (2023). GMiRec: A Multi-image Visual Recommendation Model Based on a Gated Neural Network. In: Jin, Z., Jiang, Y., Buchmann, R.A., Bi, Y., Ghiran, AM., Ma, W. (eds) Knowledge Science, Engineering and Management. KSEM 2023. Lecture Notes in Computer Science(), vol 14118. Springer, Cham. https://doi.org/10.1007/978-3-031-40286-9_27
Download citation
DOI: https://doi.org/10.1007/978-3-031-40286-9_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-40285-2
Online ISBN: 978-3-031-40286-9
eBook Packages: Computer ScienceComputer Science (R0)