research-article

ELASTIC: Edge Workload Forecasting based on Collaborative Cloud-Edge Deep Learning

Authors:
Yanan Li

Beijing University of Posts and Telecommunications, China

Beijing University of Posts and Telecommunications, China

0000-0003-4490-3466
View Profile

,
Haitao Yuan

Nanyang Technological University, Singapore

Nanyang Technological University, Singapore

0000-0001-6721-065X
View Profile

,
Zhe Fu

Tsinghua University, China

Tsinghua University, China

0000-0002-4637-6332
View Profile

,
Xiao Ma

Beijing University of Posts and Telecommunications, China

Beijing University of Posts and Telecommunications, China

0000-0001-5742-8890
View Profile

,
Mengwei Xu

Beijing University of Posts and Telecommunications, China

Beijing University of Posts and Telecommunications, China

0000-0001-6271-6993
View Profile

,
Shangguang Wang

Beijing University of Posts and Telecommunications, China

Beijing University of Posts and Telecommunications, China

0000-0001-7245-1298
View Profile

Authors Info & Claims

WWW '23: Proceedings of the ACM Web Conference 2023April 2023Pages 3056–3066https://doi.org/10.1145/3543507.3583436

Published:30 April 2023Publication History

WWW '23: Proceedings of the ACM Web Conference 2023

Pages 3056–3066

ABSTRACT

With the rapid development of edge computing in the post-COVID19 pandemic period, precise workload forecasting is considered the basis for making full use of the edge limited resources, and both edge service providers (ESPs) and edge service consumers (ESCs) can benefit significantly from it. Existing paradigms of workload forecasting (i.e., edge-only or cloud-only) are improper, due to failing to consider the inter-site correlations and might suffer from significant data transmission delays. With the increasing adoption of edge platforms by web services, it is critical to balance both accuracy and efficiency in workload forecasting. In this paper, we propose ELASTIC, which is the first study that leverages a cloud-edge collaborative paradigm for edge workload forecasting with multi-view graphs. Specifically, at the global stage, we design a learnable aggregation layer on each edge site to reduce the time consumption while capturing the inter-site correlation. Additionally, at the local stage, we design a disaggregation layer combining both the intra-site correlation and inter-site correlation to improve the prediction accuracy. Extensive experiments on realistic edge workload datasets collected from China’s largest edge service provider show that ELASTIC outperforms state-of-the-art methods, decreases time consumption, and reduces communication cost.

References

Cristiano Aguzzi, Lorenzo Gigli, Luca Sciullo, Angelo Trotta, and Marco Di Felice. 2020. From cloud to edge: Seamless software migration at the era of the web of things. IEEE Access 8 (2020), 228118–228135.Google ScholarCross Ref
George EP Box, Gwilym M Jenkins, Gregory C Reinsel, and Greta M Ljung. 2015. Time series analysis: forecasting and control. John Wiley & Sons.Google ScholarDigital Library
Eli Cortez, Anand Bonde, Alexandre Muzio, Mark Russinovich, Marcus Fontoura, and Ricardo Bianchini. 2017. Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms. In Proceedings of the Symposium on Operating Systems Principles. 153–167.Google ScholarDigital Library
Yuanrui Dong, Peng Zhao, Hanqiao Yu, Cong Zhao, and Shusen Yang. 2020. CDC: classification driven compression for bandwidth efficient edge-cloud collaborative deep learning. arXiv preprint arXiv:2005.02177 (2020).Google Scholar
Kan Guo, Yongli Hu, Zhen Qian, Hao Liu, Ke Zhang, Yanfeng Sun, Junbin Gao, and Baocai Yin. 2020. Optimized graph convolution recurrent neural network for traffic prediction. IEEE Transactions on Intelligent Transportation Systems 22, 2 (2020), 1138–1149.Google ScholarCross Ref
Kan Guo, Yongli Hu, Zhen Sean Qian, Yanfeng Sun, Junbin Gao, and Baocai Yin. 2020. An optimized temporal-spatial gated graph convolution network for traffic forecasting. IEEE Intelligent Transportation Systems Magazine (2020).Google Scholar
Kan Guo, Yongli Hu, Yanfeng Sun, Sean Qian, Junbin Gao, and Baocai Yin. 2021. Hierarchical Graph Convolution Network for Traffic Forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 151–159.Google ScholarCross Ref
Shengnan Guo, Youfang Lin, Ning Feng, and Chao Song. 2019. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 922–929.Google ScholarDigital Library
Ori Hadary, Luke Marshall, Ishai Menache, Abhisek Pan, Esaias E Greeff, David Dion, Star Dorminey, Shailesh Joshi, Yang Chen, Mark Russinovich, 2020. Protean: VM allocation service at scale. In USENIX Symposium on Operating Systems Design and Implementation. 845–861.Google Scholar
Qiang He, Zeqian Dong, Feifei Chen, Shuiguang Deng, Weifa Liang, and Yun Yang. 2022. Pyramid: enabling hierarchical neural networks with edge computing. In Proceedings of the ACM Web Conference. 1860–1870.Google ScholarDigital Library
István Hegedűs, Gábor Danner, and Márk Jelasity. 2019. Gossip learning as a decentralized alternative to federated learning. In IFIP International Conference on Distributed Applications and Interoperable Systems. Springer, 74–90.Google ScholarDigital Library
Chuang Hu, Wei Bao, Dan Wang, and Fengming Liu. 2019. Dynamic adaptive DNN surgery for inference acceleration on the edge. In Proceedings of the International Conference on Computer Communications. 1423–1431.Google ScholarDigital Library
Changhee Joo and Ness B Shroff. 2017. A novel coupled queueing model to control traffic via QoS-aware collision pricing in cognitive radio networks. In Proceedings of the International Conference on Computer Communications. 1–9.Google ScholarCross Ref
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google Scholar
Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).Google Scholar
Jakub Konečnỳ, H Brendan McMahan, Felix X Yu, Peter Richtárik, Ananda Theertha Suresh, and Dave Bacon. 2016. Federated learning: Strategies for improving communication efficiency. arXiv preprint arXiv:1610.05492 (2016).Google Scholar
Jitendra Kumar and Ashutosh Kumar Singh. 2021. Performance assessment of time series forecasting models for cloud datacenter networks’ workload prediction. Wireless Personal Communications 116, 3 (2021), 1949–1969.Google ScholarDigital Library
Jitendra Kumar, Ashutosh Kumar Singh, and Rajkumar Buyya. 2021. Self directed learning based workload forecasting model for cloud resource management. Information Sciences 543 (2021), 345–366.Google ScholarCross Ref
En Li, Liekang Zeng, Zhi Zhou, and Xu Chen. 2019. Edge AI: On-demand accelerating deep neural network inference via edge computing. IEEE Transactions on Wireless Communications 19, 1 (2019), 447–457.Google ScholarCross Ref
Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2017. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. arXiv preprint arXiv:1707.01926 (2017).Google Scholar
Haozhe Lin, Yushun Fan, Jia Zhang, and Bing Bai. 2021. Rest: Reciprocal framework for spatiotemporal-coupled predictions. In Proceedings of the ACM Web Conference. 3136–3145.Google ScholarDigital Library
Yuhua Lin and Haiying Shen. 2016. CloudFog: Leveraging fog to extend cloud gaming for thin-client MMOG with high quality of service. IEEE Transactions on Parallel and Distributed Systems 28, 2 (2016), 431–445.Google ScholarDigital Library
Boyun Liu, Jingjing Guo, Chunlin Li, and Youlong Luo. 2020. Workload forecasting based elastic resource management in edge cloud. Computers & Industrial Engineering 139 (2020), 106136.Google ScholarDigital Library
Chuizheng Meng, Sirisha Rambhatla, and Yan Liu. 2021. Cross-node federated graph neural network for spatio-temporal data modeling. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 1202–1211.Google ScholarDigital Library
Chanh Nguyen, Cristian Klein, and Erik Elmroth. 2019. Multivariate LSTM-based location-aware workload prediction for edge data centers. In IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. 341–350.Google ScholarCross Ref
Amy Ousterhout, Joshua Fried, Jonathan Behrens, Adam Belay, and Hari Balakrishnan. 2019. Shenango: Achieving high CPU efficiency for latency-sensitive datacenter workloads. In USENIX Symposium on Networked Systems Design and Implementation. 361–378.Google Scholar
Xiuquan Qiao, Pei Ren, Schahram Dustdar, Ling Liu, Huadong Ma, and Junliang Chen. 2019. Web AR: A promising future for mobile augmented reality—State of the art, challenges, and insights. Proc. IEEE 107, 4 (2019), 651–666.Google ScholarCross Ref
Hiroaki Sakoe and Seibi Chiba. 1978. Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing 26, 1 (1978), 43–49.Google ScholarCross Ref
Youngjoo Seo, Michaël Defferrard, Pierre Vandergheynst, and Xavier Bresson. 2018. Structured sequence modeling with graph convolutional recurrent networks. In International Conference on Neural Information Processing. Springer, 362–373.Google ScholarDigital Library
Xingjian Shi, Zhourong Chen, Hao Wang, Dit-Yan Yeung, Wai-Kin Wong, and Wang-chun Woo. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. Advances in Neural Information Processing Systems 28 (2015).Google Scholar
Rachee Singh, Sharad Agarwal, Matt Calder, and Paramvir Bahl. 2021. Cost-effective cloud edge traffic engineering with Cascara. In USENIX Symposium on Networked Systems Design and Implementation. 201–216.Google Scholar
Wei Sun and Xiaolong Xu. 2022. ALEDAR: An Attentions-based Encoder-Decoder and Autoregressive model for workload Forecasting of Cloud Data Center. In IEEE International Conference on Computer Supported Cooperative Work in Design. 59–64.Google ScholarCross Ref
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to sequence learning with neural networks. Advances in Neural Information Processing Systems 27 (2014), 3104–3112.Google ScholarDigital Library
Shibo Wang, Shusen Yang, and Cong Zhao. 2020. Surveiledge: Real-time video query based on collaborative cloud-edge deep learning. In Proceedings of International Conference on Computer Communications. 2519–2528.Google ScholarDigital Library
Xiaoyang Wang, Yao Ma, Yiqi Wang, Wei Jin, Xin Wang, Jiliang Tang, Caiyan Jia, and Jian Yu. 2020. Traffic flow prediction via spatial temporal graph neural network. In Proceedings of the ACM Web Conference. 1082–1092.Google ScholarDigital Library
Peter R Winters. 1960. Forecasting sales by exponentially weighted moving averages. Management Science (1960).Google Scholar
Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems 32, 1 (2020), 4–24.Google ScholarCross Ref
Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, and Chengqi Zhang. 2019. Graph wavenet for deep spatial-temporal graph modeling. arXiv preprint arXiv:1906.00121 (2019).Google Scholar
Mengwei Xu, Zhe Fu, Xiao Ma, Li Zhang, Yanan Li, Feng Qian, Shangguang Wang, Ke Li, Jingyu Yang, and Xuanzhe Liu. 2021. From cloud to edge: a first look at public edge platforms. In Proceedings of the ACM Internet Measurement Conference. 37–53.Google ScholarDigital Library
Fanghua Ye, Zhiwei Lin, Chuan Chen, Zibin Zheng, and Hong Huang. 2021. Outlier-resilient web service QoS prediction. In Proceedings of the ACM Web Conference. 3099–3110.Google ScholarDigital Library
Bing Yu, Haoteng Yin, and Zhanxing Zhu. 2017. Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. arXiv preprint arXiv:1709.04875 (2017).Google Scholar
Fisher Yu and Vladlen Koltun. 2015. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122 (2015).Google Scholar
Letian Zhang, Lixing Chen, and Jie Xu. 2021. Autodidactic neurosurgeon: Collaborative deep inference for mobile edge intelligence via online learning. In Proceedings of the ACM Web Conference. 3111–3123.Google ScholarDigital Library
Zheng Zhang, Ming Zhang, Albert G Greenberg, Y Charlie Hu, Ratul Mahajan, and Blaine Christian. 2010. Optimizing Cost and Performance in Online Service Provider Networks.. In Proceedings of the USENIX Symposium on Networked Systems Design and Implementation. 33–48.Google Scholar
Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, and Wancai Zhang. 2021. Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 11106–11115.Google ScholarCross Ref
Yonghua Zhu, Weilin Zhang, Yihai Chen, and Honghao Gao. 2019. A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment. EURASIP Journal on Wireless Communications and Networking 2019, 1 (2019), 1–18.Google ScholarDigital Library

Index Terms

ELASTIC: Edge Workload Forecasting based on Collaborative Cloud-Edge Deep Learning
1. Computer systems organization
  1. Architectures
    1. Distributed architectures

Recommendations

Workload forecasting based elastic resource management in edge cloud
Highlights
- An elastic resource management method based on workload forecasting is studied.
Abstract
Cloud services are provided at the edge of the network so that data from users can be processed and calculated at the edges. The user's irregular access triggers the fluctuations of the edge cloud workload. Therefore, an elastic ...
Read More
Strategy-proof mechanism for online resource allocation in cloud and edge collaboration
Abstract
Cloud computing is characterized by strong computing and storage capabilities, and edge computing has the advantages of low latency and low power consumption. Many service providers have begun to combine the advantages of cloud and edge computing ...
Read More
Deviceless edge computing: extending serverless computing to the edge of the network
SYSTOR '17: Proceedings of the 10th ACM International Systems and Storage Conference

The serverless paradigm has been rapidly adopted by developers of cloud-native applications, mainly because it relieves them from the burden of provisioning, scaling and operating the underlying infrastructure. In this paper, we propose a novel ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '23: Proceedings of the ACM Web Conference 2023
April 2023
4293 pages
ISBN:9781450394161
DOI:10.1145/3543507
Editors:
Ying Ding,
Jie Tang,
Juan Sequeda,
Lora Aroyo,
Carlos Castillo,
Geert-Jan Houben
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 30 April 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Workload forecasting
cloud-edge collaboration
edge computing
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate1,899of8,196submissions,23%
Upcoming Conference
WWW '24

Sponsor:

sigweb

The ACM Web Conference 2024

May 13 - 17, 2024

Singapore , Singapore
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 271
  Total Downloads
- Downloads (Last 12 months)271
- Downloads (Last 6 weeks)31
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

ELASTIC: Edge Workload Forecasting based on Collaborative Cloud-Edge Deep Learning

WWW '23: Proceedings of the ACM Web Conference 2023

ABSTRACT

References

Cited By

Index Terms

Recommendations

Workload forecasting based elastic resource management in edge cloud

Strategy-proof mechanism for online resource allocation in cloud and edge collaboration

Deviceless edge computing: extending serverless computing to the edge of the network

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

ELASTIC: Edge Workload Forecasting based on Collaborative Cloud-Edge Deep Learning

WWW '23: Proceedings of the ACM Web Conference 2023

ABSTRACT

References

Cited By

Index Terms

Recommendations

Workload forecasting based elastic resource management in edge cloud

Strategy-proof mechanism for online resource allocation in cloud and edge collaboration

Deviceless edge computing: extending serverless computing to the edge of the network

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media