Skip to main content

Deep-Reinforcement-Learning-Based Scheduling with Contiguous Resource Allocation for Next-Generation Wireless Systems

  • Conference paper
  • First Online:
Intelligent Computing

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 284))

Abstract

Scheduling plays a pivotal role in multi-user wireless communications, since the quality of service of various users largely depends upon the allocated radio resources. In this paper, we propose a novel scheduling algorithm with contiguous frequency-domain resource allocation (FDRA) based on deep reinforcement learning (DRL) that jointly selects users and allocates resource blocks (RBs). The scheduling problem is modeled as a Markov decision process, and a DRL agent determines which user and how many consecutive RBs for that user should be scheduled at each RB allocation step. The state space, action space, and reward function are delicately designed to train the DRL network. More specifically, the original quasi-continuous action space, which is inherent to contiguous FDRA, is refined into a finite and discrete action space to obtain a trade-off between the inference latency and system performance. Simulation results show that the proposed DRL-based scheduling algorithm outperforms other representative baseline schemes while having lower online computational complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. 3GPP TR 38.824, V16.0.0. Study on physical layer enhancements for NR ultra-reliable and low latency case, March 2019

    Google Scholar 

  2. 3GPP TR 38.901, V16.1.0. Study on channel model for frequencies from 0.5 to 100 GHz, December 2019

    Google Scholar 

  3. 3GPP TS 38.211, V16.3.0. NR; Physical channels and modulation, September 2020

    Google Scholar 

  4. 3GPP TS 38.213, V16.3.0. NR; Physical layer procedures for control, September 2020

    Google Scholar 

  5. 3GPP TS 38.214, V16.3.0. NR; Physical layer procedures for data, September 2020

    Google Scholar 

  6. 3GPP TS 38.331, V16.1.0. NR; Radio resource control protocol specification, July 2020

    Google Scholar 

  7. He, X., Wang, K., Huang, H., Miyazaki, T., Wang, Y., Guo, S.: Green resource allocation based on deep reinforcement learning in content-centric IoT. IEEE Trans. Emerg. Top. Comput. 8(3), 781–796 (2020)

    Article  Google Scholar 

  8. Li, X., Alkhateeb, A.: Deep learning for direct hybrid precoding in millimeter wave massive MIMO systems. In: 2019 53rd Asilomar Conference on Signals, Systems, and Computers, pp. 800–805 (2019)

    Google Scholar 

  9. Li, X., Alkhateeb, A., Tepedelenlioğlu, C.: Generative adversarial estimation of channel covariance in vehicular millimeter wave systems. In: 2018 52nd Asilomar Conference on Signals, Systems, and Computers, pp. 1572–1576 (2018)

    Google Scholar 

  10. Luong, N.C., et al.: Applications of deep reinforcement learning in communications and networking: a survey. IEEE Commun. Surv. Tutor. 21(4), 3133–3174 (2019)

    Article  Google Scholar 

  11. Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York (2014)

    MATH  Google Scholar 

  12. Sun, S., Moon, S.: Practical scheduling algorithms with contiguous resource allocation for next-generation wireless systems. IEEE Wirel. Commun. Lett. 10(4), 725–729 (2021)

    Google Scholar 

  13. Sun, S., Moon, S., Fwu, J.: Practical link adaptation algorithm with power density offsets for 5G uplink channels. IEEE Wirel. Commun. Lett. 9(6), 851–855 (2020)

    Article  Google Scholar 

  14. Tse, D.: Forward link multiuser diversity through proportional fair scheduling. Presentation at Bell Labs, August 1999

    Google Scholar 

  15. Tsiropoulou, E.E., Kapoukakis, A., Papavassiliou, S.: Uplink resource allocation in SC-FDMA wireless networks: a survey and taxonomy. Comput. Netw. 96, 1–28 (2016)

    Article  Google Scholar 

  16. Wong, I.C., Oteri, O.F., McCoy, J.W.: Resource allocation in multi data stream communication link. U.S. Patent 7 911 934, March 2011

    Google Scholar 

  17. Xu, Z., Wang, Y., Tang, J., Wang, J., Gursoy, M.C.: A deep reinforcement learning based framework for power-efficient resource allocation in cloud RANs. In: 2017 IEEE International Conference on Communications (ICC), pp. 1–6 (2017)

    Google Scholar 

  18. Yan, H., Ashikhmin, A., Yang, H.: Optimally supporting IoT with cell-free massive MIMO. In: 2020 IEEE Global Communications Conference (GLOBECOM), pp. 1–6 (2020)

    Google Scholar 

  19. Yan, H., Ashikhmin, A., Yang, H.: A scalable and energy efficient IoT system supported by cell-free massive MIMO. IEEE Internet Things J. (2021)

    Google Scholar 

  20. Yan, H., Lu, I.T.: BS-UE association and power allocation in heterogeneous massive MIMO systems. IEEE Access 8, 184045–184060 (2020)

    Article  Google Scholar 

  21. Ye, H., Li, G.Y., Juang, B.F.: Deep reinforcement learning based resource allocation for V2V communications. IEEE Trans. Veh. Technol. 68(4), 3163–3173 (2019)

    Article  Google Scholar 

  22. Zhang, C., Patras, P., Haddadi, H.: Deep learning in mobile and wireless networking: a survey. IEEE Commun. Surv. Tutor. 21(3), 2224–2287 (2019)

    Article  Google Scholar 

  23. Zhao, N., Liang, Y., Niyato, D., Pei, Y., Wu, M., Jiang, Y.: Deep reinforcement learning for user association and resource allocation in heterogeneous cellular networks. IEEE Trans. Wirel. Commun. 18(11), 5141–5152 (2019)

    Article  Google Scholar 

Download references

Acknowledgment

The authors would like to thank the Next Generation and Standards Group (NGS) in Intel Corporation for their great support of this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shu Sun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sun, S., Li, X. (2021). Deep-Reinforcement-Learning-Based Scheduling with Contiguous Resource Allocation for Next-Generation Wireless Systems. In: Arai, K. (eds) Intelligent Computing. Lecture Notes in Networks and Systems, vol 284. Springer, Cham. https://doi.org/10.1007/978-3-030-80126-7_46

Download citation

Publish with us

Policies and ethics