skip to main content
research-article

CoRec: An Efficient Internet Behavior-based Recommendation Framework with Edge-cloud Collaboration on Deep Convolution Neural Networks

Authors Info & Claims
Published:20 December 2022Publication History
Skip Abstract Section

Abstract

Both accurate and fast mobile recommendation systems based on click behaviors analysis are crucial in e-business. Deep learning has achieved state-of-the-art accuracy and the traditional wisdom often hosts these computation-intensive models in powerful cloud centers. However, the cloud-only approaches put significant computational pressure on cloud servers and increase the latency in heavy-load scenarios. Moreover, existing work often adopts RNN structures to model behaviors that suffer from low processing speed for under-utilization of parallel devices such as GPUs. In this work, we propose an efficient internet behavior-based recommendation framework with edge-cloud collaboration on deep CNNs (CoRec) to improve both the accuracy and speed for mobile recommendation. A novel convolutional interest network (CIN) improves the accuracy by modeling the long- and short-term interests and accelerates the prediction through parallel-friendly convolutions. To further improve the serving throughput and latency, a novel device-cloud collaboration strategy reduces workloads by pre-computing and caching long-term interests in the cloud offline and real-time computation of short-term interests in devices. Extensive experiments on real-world datasets show that CoRec significantly outperforms the state-of-the-art methods in accuracy and has achieved at least an order of magnitude improvement in latency and throughput compared to cloud-only RNN-based approaches for long behaviors.

REFERENCES

  1. [1] Abadi Martín, Barham Paul, Chen Jianmin, Chen Zhifeng, Davis Andy, Dean Jeffrey, Devin Matthieu, Ghemawat Sanjay, Irving Geoffrey, Isard Michael, et al. 2016. Tensorflow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI16). 265283.Google ScholarGoogle Scholar
  2. [2] Bahdanau Dzmitry, Cho Kyunghyun, and Bengio Yoshua. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).Google ScholarGoogle Scholar
  3. [3] Bai Shaojie, Kolter J. Zico, and Koltun Vladlen. 2018. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271 (2018).Google ScholarGoogle Scholar
  4. [4] Chakrabarti Deepayan, Agarwal Deepak, and Josifovski Vanja. 2008. Contextual advertising by combining relevance with click feedback. In Proceedings of the 17th International Conference on World Wide Web. 417426.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Chang Jianxin, Gao Chen, Zheng Yu, Hui Yiqun, Niu Yanan, Song Yang, Jin Depeng, and Li Yong. 2021. Sequential recommendation with graph neural networks. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 378387.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. [6] Chapelle Olivier, Manavoglu Eren, and Rosales Romer. 2014. Simple and scalable response prediction for display advertising. ACM Trans. Intell. Syst. Technol. 5, 4 (2014), 134.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Chen Cen, Li Kenli, Ouyang Aijia, and Li Keqin. 2018. FlinkCL: An openCL-based in-memory computing architecture on heterogeneous CPU-GPU clusters for big data. IEEE Trans. Comput. 67, 12 (2018), 17651779.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Chen Cen, Li Kenli, Ouyang Aijia, Zeng Zeng, and Li Keqin. 2018. GFlink: An in-memory computing architecture on heterogeneous CPU-GPU clusters for big data. IEEE Trans. Parallel Distrib. Syst. 29, 6 (2018), 12751288.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Chen Cen, Li Kenli, Teo Sin G., Zou Xiaofeng, Li Keqin, and Zeng Zeng. 2020. Citywide traffic flow prediction based on multiple gated spatio-temporal convolutional neural networks. ACM Trans. Knowl. Discov. Data 14, 4 (2020), 123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Cheng Heng-Tze, Koc Levent, Harmsen Jeremiah, Shaked Tal, Chandra Tushar, Aradhye Hrishi, Anderson Glen, Corrado Greg, Chai Wei, Ispir Mustafa, et al. 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. ACM, 710.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Chung Junyoung, Gulcehre Caglar, Cho KyungHyun, and Bengio Yoshua. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014).Google ScholarGoogle Scholar
  12. [12] Dauphin Yann N., Fan Angela, Auli Michael, and Grangier David. 2017. Language modeling with gated convolutional networks. In Proceedings of the 34th International Conference on Machine Learning. JMLR. org, 933941.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Dave Kushal S. and Varma Vasudeva. 2010. Learning the click-through rate for rare/new ads from similar ads. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 897898.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Fang Zhou, Hong Dezhi, and Gupta Rajesh K.. 2019. Serving deep neural networks at the cloud edge for vision applications on mobile platforms. In Proceedings of the 10th ACM Multimedia Systems Conference. 3647.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Feng Yufei, Lv Fuyu, Shen Weichen, Wang Menghan, Sun Fei, Zhu Yu, and Yang Keping. 2019. Deep session interest network for click-through rate prediction. arXiv preprint arXiv:1905.06482 (2019).Google ScholarGoogle Scholar
  16. [16] Gehring Jonas, Auli Michael, Grangier David, and Dauphin Yann N.. 2016. A convolutional encoder model for neural machine translation. arXiv preprint arXiv:1611.02344 (2016).Google ScholarGoogle Scholar
  17. [17] He Kaiming, Zhang Xiangyu, Ren Shaoqing, and Sun Jian. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770778.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] He Xinran, Pan Junfeng, Jin Ou, Xu Tianbing, Liu Bo, Xu Tao, Shi Yanxin, Atallah Antoine, Herbrich Ralf, Bowers Stuart, et al. 2014. Practical lessons from predicting clicks on ads at Facebook. In Proceedings of the 8th International Workshop on Data Mining for Online Advertising. 19.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Huang Gao, Liu Zhuang, Maaten Laurens Van Der, and Weinberger Kilian Q.. 2017. Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 47004708.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Huang Yakun, Qiao Xiuquan, Ren Pei, Liu Ling, Pu Calton, Dustdar Schahram, and Chen Junliang. 2020. A lightweight collaborative deep neural network for the mobile web in edge cloud. IEEE Trans. Mob. Comput. 21, 7 (2020).Google ScholarGoogle Scholar
  21. [21] Huang Yakun, Qiao Xiuquan, Tang Jian, Ren Pei, Liu Ling, Pu Calton, and Chen Junliang. 2020. DeepAdapter: A collaborative deep learning framework for the mobile web using context-aware network pruning. In Proceedings of the IEEE Conference on Computer Communications. IEEE, 834843.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Kalchbrenner Nal, Espeholt Lasse, Simonyan Karen, Oord Aaron van den, Graves Alex, and Kavukcuoglu Koray. 2016. Neural machine translation in linear time. arXiv preprint arXiv:1610.10099 (2016).Google ScholarGoogle Scholar
  23. [23] Kang Yiping, Hauswald Johann, Gao Cao, Rovinski Austin, Mudge Trevor, Mars Jason, and Tang Lingjia. 2017. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. ACM SIGARCH Comput. Archit. News 45, 1 (2017), 615629.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Lai Guokun, Chang Wei Cheng, Yang Yiming, and Liu Hanxiao. 2017. Modeling long- and short-term temporal patterns with deep neural networks. In the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 95–104. .Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Lian Jianxun, Zhou Xiaohuan, Zhang Fuzheng, Chen Zhongxia, Xie Xing, and Sun Guangzhong. 2018. xDeepFM: Combining explicit and implicit feature interactions for recommender systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 17541763.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Lin Tsung-Yi, Dollár Piotr, Girshick Ross, He Kaiming, Hariharan Bharath, and Belongie Serge. 2017. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 21172125.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Liu Bin, Tang Ruiming, Chen Yingzhi, Yu Jinkai, Guo Huifeng, and Zhang Yuzhou. 2019. Feature generation by convolutional neural network for click-through rate prediction. In Proceedings of the World Wide Web Conference. ACM, 11191129.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Lu Wantong, Yu Yantao, Chang Yongzhe, Wang Zhen, Li Chenhui, and Yuan Bo. 2021. A dual input-aware factorization machine for CTR prediction. In Proceedings of the 29th International Conference on International Joint Conferences on Artificial Intelligence. 31393145.Google ScholarGoogle Scholar
  29. [29] Maaten Laurens van der and Hinton Geoffrey. 2008. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, Nov. (2008), 25792605.Google ScholarGoogle Scholar
  30. [30] Mao Jiachen, Chen Xiang, Nixon Kent W., Krieger Christopher, and Chen Yiran. 2017. MoDNN: Local distributed mobile computing system for deep neural network. In Proceedings of the Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 13961401.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Mao Jiachen, Yang Qing, Li Ang, Li Hai, and Chen Yiran. 2019. MobiEye: An efficient cloud-based video detection system for real-time mobile applications. In Proceedings of the 56th Annual Design Automation Conference. 16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] McMahan H. Brendan, Holt Gary, Sculley David, Young Michael, Ebner Dietmar, Grady Julian, Nie Lan, Phillips Todd, Davydov Eugene, Golovin Daniel, et al. 2013. Ad click prediction: A view from the trenches. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 12221230.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Mikolov Tomas, Sutskever Ilya, Chen Kai, Corrado Greg S., and Dean Jeff. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the Conference on Advances in Neural Information Processing Systems. 31113119.Google ScholarGoogle Scholar
  34. [34] Oentaryo Richard J., Lim Ee-Peng, Low Jia-Wei, Lo David, and Finegold Michael. 2014. Predicting response in mobile advertising with hierarchical importance-aware factorization machine. In Proceedings of the 7th ACM International Conference on Web Search and Data Mining. 123132.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] Qu Yanru, Cai Han, Ren Kan, Zhang Weinan, Yu Yong, Wen Ying, and Wang Jun. 2016. Product-based neural networks for user response prediction. In Proceedings of the IEEE 16th International Conference on Data Mining (ICDM). IEEE, 11491154.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Quijano-Sánchez Lara, Cantador Iván, Cortés-Cediel María E., and Gil Olga. 2020. Recommender systems for smart cities. Inf. Syst. 92 (2020), 101545.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Ran Xukan, Chen Haolianz, Zhu Xiaodan, Liu Zhenming, and Chen Jiasi. 2018. DeepDecision: A mobile deep learning framework for edge video analytics. In Proceedings of the IEEE Conference on Computer Communications. IEEE, 14211429.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] Rendle Steffen. 2010. Factorization machines. In Proceedings of the IEEE International Conference on Data Mining. IEEE, 9951000.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. [39] Rendle Steffen. 2012. Factorization machines with libFM. ACM Trans. Intell. Syst. Technol. 3, 3 (2012), 122.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Richardson Matthew, Dominowska Ewa, and Ragno Robert. 2007. Predicting clicks: Estimating the click-through rate for new ads. In Proceedings of the 16th International Conference on World Wide Web. ACM, 521530.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Sandler Mark, Howard Andrew, Zhu Menglong, Zhmoginov Andrey, and Chen Liang-Chieh. 2018. MobileNetV2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 45104520.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Teerapittayanon Surat, McDanel Bradley, and Kung Hsiang-Tsung. 2017. Distributed deep neural networks over the cloud, the edge and end devices. In Proceedings of the IEEE 37th International Conference on Distributed Computing Systems (ICDCS). IEEE, 328339.Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Yan Ling, Li Wu-Jun, Xue Gui-Rong, and Han Dingyi. 2014. Coupled group lasso for web-scale CTR prediction in display advertising. In Proceedings of the International Conference on Machine Learning. 802810.Google ScholarGoogle Scholar
  44. [44] You Jiaxuan, Wang Yichen, Pal Aditya, Eksombatchai Pong, Rosenburg Chuck, and Leskovec Jure. 2019. Hierarchical temporal convolutional networks for dynamic recommender systems. In Proceedings of the World Wide Web Conference. ACM, 22362246.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Yu Feng, Liu Qiang, Wu Shu, Wang Liang, and Tan Tieniu. 2016. A dynamic recurrent model for next basket recommendation. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 729732.Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. [46] Yu Fisher, Wang Dequan, Shelhamer Evan, and Darrell Trevor. 2018. Deep layer aggregation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 24032412.Google ScholarGoogle ScholarCross RefCross Ref
  47. [47] Zhang Minjia, Rajbhandari Samyam, Wang Wenhan, and He Yuxiong. 2018. DeepCPU: Serving RNN-based deep learning models 10x faster. In Proceedings of the USENIX Annual Technical Conference (USENIXATC’18). 951965.Google ScholarGoogle Scholar
  48. [48] Zhang Wenjie, Liu Shunqi, Gandhi Oktoviano, Rodríguez-Gallegos Carlos D., Quan Hao, and Srinivasan Dipti. 2021. Deep-learning-based probabilistic estimation of solar PV soiling loss. IEEE Trans. Sustain. Energy 12, 4 (2021), 24362444.Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Zhang Wenjie, Quan Hao, and Srinivasan Dipti. 2018. An improved quantile regression neural network for probabilistic load forecasting. IEEE Trans. Smart Grid 10, 4 (2018).Google ScholarGoogle Scholar
  50. [50] Zhang Wenjie, Quan Hao, and Srinivasan Dipti. 2018. Parallel and reliable probabilistic load forecasting via quantile regression forest and quantile determination. Energy 160 (2018), 810819.Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Zhou Guorui, Mou Na, Fan Ying, Pi Qi, Bian Weijie, Zhou Chang, Zhu Xiaoqiang, and Gai Kun. 2019. Deep interest evolution network for click-through rate prediction. In Proceedings of the AAAI Conference on Artificial Intelligence. 59415948.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. [52] Zhou Guorui, Zhu Xiaoqiang, Song Chenru, Fan Ying, Zhu Han, Ma Xiao, Yan Yanghui, Jin Junqi, Li Han, and Gai Kun. 2018. Deep interest network for click-through rate prediction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 10591068.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. CoRec: An Efficient Internet Behavior-based Recommendation Framework with Edge-cloud Collaboration on Deep Convolution Neural Networks

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image ACM Transactions on Sensor Networks
            ACM Transactions on Sensor Networks  Volume 19, Issue 2
            May 2023
            599 pages
            ISSN:1550-4859
            EISSN:1550-4867
            DOI:10.1145/3575873
            Issue’s Table of Contents

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 20 December 2022
            • Online AM: 28 July 2022
            • Accepted: 11 March 2022
            • Revised: 21 January 2022
            • Received: 7 August 2021
            Published in tosn Volume 19, Issue 2

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Refereed

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Full Text

          View this article in Full Text.

          View Full Text

          HTML Format

          View this article in HTML Format .

          View HTML Format