Loading [a11y]/accessibility-menu.js
Elastic Task Offloading and Resource Allocation Over Hybrid Cloud: A Reinforcement Learning Approach | IEEE Journals & Magazine | IEEE Xplore

Elastic Task Offloading and Resource Allocation Over Hybrid Cloud: A Reinforcement Learning Approach


Abstract:

Hybrid cloud is an emerging computing cloud solution that leverages the power of the public cloud, without abandoning the computation resources of existing on-premises da...Show More

Abstract:

Hybrid cloud is an emerging computing cloud solution that leverages the power of the public cloud, without abandoning the computation resources of existing on-premises data-centers. Further, the wide adoption of cloud-native technology, like containers, brings the capability of rapid horizontal and vertical scaling to task workloads. However, the heterogeneity and flexibility can bring more complexity to task processing performance optimization, especially with constrained on-premises energy consumption and public cloud renting cost quota. In this paper, we seek to optimize the task processing performance under long-term on-premises energy consumption and public cloud renting cost constraints via dynamic task offloading and elastic scaling. We formulate the problem as a two-stage mixed integer non-linear programming (MINLP) problem, and propose an online approach named ETHC (elastic task offloading and resource allocation handler over hybrid cloud). For the first stage, we introduce a Lyapunov optimization-assisted Deep Reinforcement Learning (DRL) agent to decompose the long-term optimization problem into per-time-segment sub-problems on making task offloading decisions. In the second stage, based on the M/M/k queuing model, we prove the container instance number configuration and per-instance resource allocation problem as a convex MINLP problem. An efficient bi-section-based algorithm is introduced to obtain the optimal configurations. Extensive simulations show that ETHC manages to stabilize the task processing queue and satisfy the long-term constraints under various environments and parameters setup, with slight overhead on the convergence speed. Besides, optimal resource configuration and instance number can be obtained at each time-segment with low time complexity.
Published in: IEEE Transactions on Network and Service Management ( Volume: 21, Issue: 2, April 2024)
Page(s): 1983 - 1997
Date of Publication: 29 December 2023

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.