Journals & Magazines >IEEE/ACM Transactions on Netw... >Volume: 29 Issue: 2

Learning to Schedule Network Resources Throughput and Delay Optimally Using Q⁺-Learning

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

As network architecture becomes complex and the user requirement gets diverse, the role of efficient network resource management becomes more important. However, existing...Show More

Metadata

Abstract:

As network architecture becomes complex and the user requirement gets diverse, the role of efficient network resource management becomes more important. However, existing throughput-optimal scheduling algorithms such as the max-weight algorithm suffer from poor delay performance. In this paper, we present reinforcement learning-based network scheduling algorithms for a single-hop downlink scenario which achieve throughput-optimality and converge to minimal delay. To this end, we first formulate the network optimization problem as a Markov decision process (MDP) problem. Then, we introduce a new state-action value function called

$Q^{+}$ -function and develop a reinforcement learning algorithm called

$Q^{+}$ -learning with UCB (Upper Confidence Bound) exploration which guarantees small performance loss during a learning process. We also derive an upper bound of the sample complexity in our algorithm, which is more efficient than the best known bound from Q-learning with UCB exploration by a factor of

$\gamma ^{2}$ where

$\gamma$ is the discount factor of the MDP problem. Finally, via simulation, we verify that our algorithm shows a delay reduction of up to 40.8% compared to the max-weight algorithm over various scenarios. We also show that the Q⁺-learning with UCB exploration converges to an

$\epsilon$ -optimal policy 10 times faster than Q-learning with UCB.

Published in: IEEE/ACM Transactions on Networking ( Volume: 29, Issue: 2, April 2021)

Page(s): 750 - 763

Date of Publication: 26 January 2021

ISSN Information:

DOI: 10.1109/TNET.2021.3051663

Funding Agency:

Contents

References is not available for this document.

Learning to Schedule Network Resources Throughput and Delay Optimally Using Q⁺-Learning

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Learning to Schedule Network Resources Throughput and Delay Optimally Using Q+-Learning

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

Supplemental Items

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Learning to Schedule Network Resources Throughput and Delay Optimally Using Q⁺-Learning