skip to main content
10.1145/3543507.3583436acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

ELASTIC: Edge Workload Forecasting based on Collaborative Cloud-Edge Deep Learning

Published:30 April 2023Publication History

ABSTRACT

With the rapid development of edge computing in the post-COVID19 pandemic period, precise workload forecasting is considered the basis for making full use of the edge limited resources, and both edge service providers (ESPs) and edge service consumers (ESCs) can benefit significantly from it. Existing paradigms of workload forecasting (i.e., edge-only or cloud-only) are improper, due to failing to consider the inter-site correlations and might suffer from significant data transmission delays. With the increasing adoption of edge platforms by web services, it is critical to balance both accuracy and efficiency in workload forecasting. In this paper, we propose ELASTIC, which is the first study that leverages a cloud-edge collaborative paradigm for edge workload forecasting with multi-view graphs. Specifically, at the global stage, we design a learnable aggregation layer on each edge site to reduce the time consumption while capturing the inter-site correlation. Additionally, at the local stage, we design a disaggregation layer combining both the intra-site correlation and inter-site correlation to improve the prediction accuracy. Extensive experiments on realistic edge workload datasets collected from China’s largest edge service provider show that ELASTIC outperforms state-of-the-art methods, decreases time consumption, and reduces communication cost.

References

  1. Cristiano Aguzzi, Lorenzo Gigli, Luca Sciullo, Angelo Trotta, and Marco Di Felice. 2020. From cloud to edge: Seamless software migration at the era of the web of things. IEEE Access 8 (2020), 228118–228135.Google ScholarGoogle ScholarCross RefCross Ref
  2. George EP Box, Gwilym M Jenkins, Gregory C Reinsel, and Greta M Ljung. 2015. Time series analysis: forecasting and control. John Wiley & Sons.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Eli Cortez, Anand Bonde, Alexandre Muzio, Mark Russinovich, Marcus Fontoura, and Ricardo Bianchini. 2017. Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms. In Proceedings of the Symposium on Operating Systems Principles. 153–167.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Yuanrui Dong, Peng Zhao, Hanqiao Yu, Cong Zhao, and Shusen Yang. 2020. CDC: classification driven compression for bandwidth efficient edge-cloud collaborative deep learning. arXiv preprint arXiv:2005.02177 (2020).Google ScholarGoogle Scholar
  5. Kan Guo, Yongli Hu, Zhen Qian, Hao Liu, Ke Zhang, Yanfeng Sun, Junbin Gao, and Baocai Yin. 2020. Optimized graph convolution recurrent neural network for traffic prediction. IEEE Transactions on Intelligent Transportation Systems 22, 2 (2020), 1138–1149.Google ScholarGoogle ScholarCross RefCross Ref
  6. Kan Guo, Yongli Hu, Zhen Sean Qian, Yanfeng Sun, Junbin Gao, and Baocai Yin. 2020. An optimized temporal-spatial gated graph convolution network for traffic forecasting. IEEE Intelligent Transportation Systems Magazine (2020).Google ScholarGoogle Scholar
  7. Kan Guo, Yongli Hu, Yanfeng Sun, Sean Qian, Junbin Gao, and Baocai Yin. 2021. Hierarchical Graph Convolution Network for Traffic Forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 151–159.Google ScholarGoogle ScholarCross RefCross Ref
  8. Shengnan Guo, Youfang Lin, Ning Feng, and Chao Song. 2019. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 922–929.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Ori Hadary, Luke Marshall, Ishai Menache, Abhisek Pan, Esaias E Greeff, David Dion, Star Dorminey, Shailesh Joshi, Yang Chen, Mark Russinovich, 2020. Protean: VM allocation service at scale. In USENIX Symposium on Operating Systems Design and Implementation. 845–861.Google ScholarGoogle Scholar
  10. Qiang He, Zeqian Dong, Feifei Chen, Shuiguang Deng, Weifa Liang, and Yun Yang. 2022. Pyramid: enabling hierarchical neural networks with edge computing. In Proceedings of the ACM Web Conference. 1860–1870.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. István Hegedűs, Gábor Danner, and Márk Jelasity. 2019. Gossip learning as a decentralized alternative to federated learning. In IFIP International Conference on Distributed Applications and Interoperable Systems. Springer, 74–90.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Chuang Hu, Wei Bao, Dan Wang, and Fengming Liu. 2019. Dynamic adaptive DNN surgery for inference acceleration on the edge. In Proceedings of the International Conference on Computer Communications. 1423–1431.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Changhee Joo and Ness B Shroff. 2017. A novel coupled queueing model to control traffic via QoS-aware collision pricing in cognitive radio networks. In Proceedings of the International Conference on Computer Communications. 1–9.Google ScholarGoogle ScholarCross RefCross Ref
  14. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google ScholarGoogle Scholar
  15. Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).Google ScholarGoogle Scholar
  16. Jakub Konečnỳ, H Brendan McMahan, Felix X Yu, Peter Richtárik, Ananda Theertha Suresh, and Dave Bacon. 2016. Federated learning: Strategies for improving communication efficiency. arXiv preprint arXiv:1610.05492 (2016).Google ScholarGoogle Scholar
  17. Jitendra Kumar and Ashutosh Kumar Singh. 2021. Performance assessment of time series forecasting models for cloud datacenter networks’ workload prediction. Wireless Personal Communications 116, 3 (2021), 1949–1969.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Jitendra Kumar, Ashutosh Kumar Singh, and Rajkumar Buyya. 2021. Self directed learning based workload forecasting model for cloud resource management. Information Sciences 543 (2021), 345–366.Google ScholarGoogle ScholarCross RefCross Ref
  19. En Li, Liekang Zeng, Zhi Zhou, and Xu Chen. 2019. Edge AI: On-demand accelerating deep neural network inference via edge computing. IEEE Transactions on Wireless Communications 19, 1 (2019), 447–457.Google ScholarGoogle ScholarCross RefCross Ref
  20. Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2017. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. arXiv preprint arXiv:1707.01926 (2017).Google ScholarGoogle Scholar
  21. Haozhe Lin, Yushun Fan, Jia Zhang, and Bing Bai. 2021. Rest: Reciprocal framework for spatiotemporal-coupled predictions. In Proceedings of the ACM Web Conference. 3136–3145.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Yuhua Lin and Haiying Shen. 2016. CloudFog: Leveraging fog to extend cloud gaming for thin-client MMOG with high quality of service. IEEE Transactions on Parallel and Distributed Systems 28, 2 (2016), 431–445.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Boyun Liu, Jingjing Guo, Chunlin Li, and Youlong Luo. 2020. Workload forecasting based elastic resource management in edge cloud. Computers & Industrial Engineering 139 (2020), 106136.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Chuizheng Meng, Sirisha Rambhatla, and Yan Liu. 2021. Cross-node federated graph neural network for spatio-temporal data modeling. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 1202–1211.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Chanh Nguyen, Cristian Klein, and Erik Elmroth. 2019. Multivariate LSTM-based location-aware workload prediction for edge data centers. In IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. 341–350.Google ScholarGoogle ScholarCross RefCross Ref
  26. Amy Ousterhout, Joshua Fried, Jonathan Behrens, Adam Belay, and Hari Balakrishnan. 2019. Shenango: Achieving high CPU efficiency for latency-sensitive datacenter workloads. In USENIX Symposium on Networked Systems Design and Implementation. 361–378.Google ScholarGoogle Scholar
  27. Xiuquan Qiao, Pei Ren, Schahram Dustdar, Ling Liu, Huadong Ma, and Junliang Chen. 2019. Web AR: A promising future for mobile augmented reality—State of the art, challenges, and insights. Proc. IEEE 107, 4 (2019), 651–666.Google ScholarGoogle ScholarCross RefCross Ref
  28. Hiroaki Sakoe and Seibi Chiba. 1978. Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing 26, 1 (1978), 43–49.Google ScholarGoogle ScholarCross RefCross Ref
  29. Youngjoo Seo, Michaël Defferrard, Pierre Vandergheynst, and Xavier Bresson. 2018. Structured sequence modeling with graph convolutional recurrent networks. In International Conference on Neural Information Processing. Springer, 362–373.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Xingjian Shi, Zhourong Chen, Hao Wang, Dit-Yan Yeung, Wai-Kin Wong, and Wang-chun Woo. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. Advances in Neural Information Processing Systems 28 (2015).Google ScholarGoogle Scholar
  31. Rachee Singh, Sharad Agarwal, Matt Calder, and Paramvir Bahl. 2021. Cost-effective cloud edge traffic engineering with Cascara. In USENIX Symposium on Networked Systems Design and Implementation. 201–216.Google ScholarGoogle Scholar
  32. Wei Sun and Xiaolong Xu. 2022. ALEDAR: An Attentions-based Encoder-Decoder and Autoregressive model for workload Forecasting of Cloud Data Center. In IEEE International Conference on Computer Supported Cooperative Work in Design. 59–64.Google ScholarGoogle ScholarCross RefCross Ref
  33. Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to sequence learning with neural networks. Advances in Neural Information Processing Systems 27 (2014), 3104–3112.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Shibo Wang, Shusen Yang, and Cong Zhao. 2020. Surveiledge: Real-time video query based on collaborative cloud-edge deep learning. In Proceedings of International Conference on Computer Communications. 2519–2528.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Xiaoyang Wang, Yao Ma, Yiqi Wang, Wei Jin, Xin Wang, Jiliang Tang, Caiyan Jia, and Jian Yu. 2020. Traffic flow prediction via spatial temporal graph neural network. In Proceedings of the ACM Web Conference. 1082–1092.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Peter R Winters. 1960. Forecasting sales by exponentially weighted moving averages. Management Science (1960).Google ScholarGoogle Scholar
  37. Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems 32, 1 (2020), 4–24.Google ScholarGoogle ScholarCross RefCross Ref
  38. Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, and Chengqi Zhang. 2019. Graph wavenet for deep spatial-temporal graph modeling. arXiv preprint arXiv:1906.00121 (2019).Google ScholarGoogle Scholar
  39. Mengwei Xu, Zhe Fu, Xiao Ma, Li Zhang, Yanan Li, Feng Qian, Shangguang Wang, Ke Li, Jingyu Yang, and Xuanzhe Liu. 2021. From cloud to edge: a first look at public edge platforms. In Proceedings of the ACM Internet Measurement Conference. 37–53.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Fanghua Ye, Zhiwei Lin, Chuan Chen, Zibin Zheng, and Hong Huang. 2021. Outlier-resilient web service QoS prediction. In Proceedings of the ACM Web Conference. 3099–3110.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Bing Yu, Haoteng Yin, and Zhanxing Zhu. 2017. Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. arXiv preprint arXiv:1709.04875 (2017).Google ScholarGoogle Scholar
  42. Fisher Yu and Vladlen Koltun. 2015. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122 (2015).Google ScholarGoogle Scholar
  43. Letian Zhang, Lixing Chen, and Jie Xu. 2021. Autodidactic neurosurgeon: Collaborative deep inference for mobile edge intelligence via online learning. In Proceedings of the ACM Web Conference. 3111–3123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Zheng Zhang, Ming Zhang, Albert G Greenberg, Y Charlie Hu, Ratul Mahajan, and Blaine Christian. 2010. Optimizing Cost and Performance in Online Service Provider Networks.. In Proceedings of the USENIX Symposium on Networked Systems Design and Implementation. 33–48.Google ScholarGoogle Scholar
  45. Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, and Wancai Zhang. 2021. Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 11106–11115.Google ScholarGoogle ScholarCross RefCross Ref
  46. Yonghua Zhu, Weilin Zhang, Yihai Chen, and Honghao Gao. 2019. A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment. EURASIP Journal on Wireless Communications and Networking 2019, 1 (2019), 1–18.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. ELASTIC: Edge Workload Forecasting based on Collaborative Cloud-Edge Deep Learning

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WWW '23: Proceedings of the ACM Web Conference 2023
      April 2023
      4293 pages
      ISBN:9781450394161
      DOI:10.1145/3543507

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 30 April 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

      Upcoming Conference

      WWW '24
      The ACM Web Conference 2024
      May 13 - 17, 2024
      Singapore , Singapore

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format