Processing math: 100%
Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning | IEEE Journals & Magazine | IEEE Xplore

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning


Abstract:

Multi-user delay-constrained scheduling is a crucial challenge in various real-world applications, such as wireless communication, live streaming, and cloud computing. Th...Show More

Abstract:

Multi-user delay-constrained scheduling is a crucial challenge in various real-world applications, such as wireless communication, live streaming, and cloud computing. The scheduler must make real-time decisions to guarantee both delay and resource constraints simultaneously, without prior information on system dynamics that can be time-varying and challenging to estimate. Additionally, many practical scenarios suffer from partial observability issues due to sensing noise or hidden correlation. To address these challenges, we propose a deep reinforcement learning (DRL) algorithm called Recurrent Softmax Delayed Deep Double Deterministic Policy Gradient ( \mathtt {RSD4} ) (https://github.com/hupihe/RSD4), which is a data-driven method based on a Partially Observed Markov Decision Process (POMDP) formulation. \mathtt {RSD4} guarantees resource and delay constraints by Lagrangian dual and delay-sensitive queues, respectively. It also efficiently handles partial observability with a memory mechanism enabled by the recurrent neural network (RNN). Moreover, it introduces user-level decomposition and node-level merging to support large-scale multihop scenarios. Extensive experiments on simulated and real-world datasets demonstrate that \mathtt {RSD4} is robust to system dynamics and partially observable environments and achieves superior performance over existing methods.
Published in: IEEE/ACM Transactions on Networking ( Volume: 32, Issue: 3, June 2024)
Page(s): 2344 - 2359
Date of Publication: 09 February 2024

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.