research-article

D2D communication resource allocation algorithm based on multi-agent reinforcement learning

Authors:

Zhijun QiAuthors Info & Claims

ICCNS '23: Proceedings of the 2023 13th International Conference on Communication and Network Security

Pages 276 - 281

https://doi.org/10.1145/3638782.3638825

Published: 18 April 2024 Publication History

Abstract

To solve the interference problem of device-to-device (D2D) communication in cellular network, a distributed resource allocation algorithm based on simultaneous wireless information and power transfer technology and dual deep Q-network is proposed to achieve distributed resource allocation and maximize energy efficiency of D2D links. Firstly, the resource allocation problem of D2D communication is formulated as a Markov decision process. Secondly, the allocation problem is decomposed into two sub problems: power control and channel allocation. Then, the reinforcement learning technique is introduced to model the optimization problem as a multi-agent learning optimal strategy problem. Finally, by continuously iterative updating and learning better action strategies, the optimization goals and reasonable allocation of resources are achieved. Experimental results show that the proposed algorithm can effectively improve the energy efficiency of D2D link layer and the throughput of D2D link, and has certain feasibility and effectiveness.

References

[1]

Gupta Sucheta, Patel Rajan, Gupta Rajesh, Tanwar Sudeep, Patel Nimisha. 2022. A Survey on Resource Allocation Schemes in Device-to-Device Communication.In 12th International Conference on Cloud Computing, Data Science and Engineering, IEEE,Virtual, Online, India, 140-145. https://doi.org/10.1109/Confluence52989.2022.9734183

[2]

Khan Md.Tabrej, Adholiya Ashish. 2022. Device to Device Communication over 5G. In 3rd International Conference on Computing Science, Communication and Security, Springer, Virtual, Online, 255-273. https://doi.org/10.1007/978-3-031-10551-7_19

[3]

Mohamed Khalid S., Alias Mohamad Y., Roslee Mardeni, Raji Yusuf M. 2021.Towards green communication in 5G systems: Survey on beamforming concept. IET Communications, 15, 1(January 2021),142-154. https://doi.org/10.1049/cmu2.12066

Digital Library

[4]

Hao Yuanyuan, Ni Qiang, Li Hai, Hou Shujuan. 2018. Robust Multi-Objective Optimization for EE-SE Tradeoff in D2D Communications Underlaying Heterogeneous Networks. IEEE T. Commun., 66, 10(October 2018), 4936-4949. https://doi.org/10.1109/ TCOMM.2018.2834920

[5]

Pei Lu,Yang Zhaohui, Pan Cunhua, Huang Wenhuan, Chen Ming, Elkashlan Maged, Nallanathan Arumugam. 2018. Energy-efficient D2D communications underlaying NOMA-based networks with energy harvesting. IEEE Commun. Lett., 22,5(May 2018), 914-917. https://doi.org/10.1109/LCOMM.2018.2811782

[6]

Huang Jun, Cui Jingjing, Xing Cong-Cong, Gharavi Hamid. 2022. Energy-Efficient SWIPT-Empowered D2D Mode Selection. IEEE T. Veh. Technol., 69,4 (April 2020), 3903-3915. https://doi.org/10.1109/TVT.2020.2970235

[7]

Xu Yongjun, Gu Bowen, Li Dong, Yang Zhaohui, Huang Chongwen, Wong Kai-Kit. 2022. Resource allocation for secure SWIPT-enabled D2D communications with α fairness. IEEE T. Veh. Technol., 71, 1(January 2022), 1101-1106. https://doi.org/10.1109/TVT. 2021.3129787

[8]

Al-Wesabi Fahd N., Khan Imran, Mohammed Saleem Latteef, Jameel Huda Farooq, Alamgeer Mohammad, Al-Sharafi Ali M., Kim Byung Seo.2022. Optimal resource allocation method for device-to-device communication in 5G networks. Computers, Materials and Continua, 71, 1(January 2022), 1-15. https://doi.org/10.32604/cmc.2022.018469

[9]

Sreedevi A.G., Rama Rao T. 2019. Reinforcement learning algorithm for 5G indoor device-to-device communications. T. Emerg Telecommun. T., 30,9(September 2019), e3670. https://doi.org/10.1002/ett.3670

Digital Library

[10]

Yuan Yazhou, Li Zhijie, Yang Yi, Guan Xinping. 2022. Double Deep Q-Network Based Distributed Resource Matching Algorithm for D2D Communication. IEEE T. Veh. Technol., 71, 1(January 2022), 984-999. https://doi.org/10.1109/TVT.2021.3130159

[11]

Omidkar Atefeh, Khalili Ata, Nguyen Ha H., Shafiei Hossein. 2022. Reinforcement-Learning-Based Resource Allocation for Energy-Harvesting-Aided D2D Communications in IoT Networks. IEEE Internet Things, 9,17(September 2022),16521-16531. https://doi.org/10.1109/JIOT.2022. 3151001

[12]

Deng Bingguang, Xu Chengyi, Zhang Tai, Sun Yuanxin, Zhang Lin, Pei Errong.2023. A Joint Resource Allocation Method of D2D Communication Resources Based on Multi Agent Deep Reinforcement Learning. Journal of Electronics and Information Technology, 45, 4(April 2023), 1173-1182. https://doi.org/ 10.11999/JEIT220231 (In Chinese)

[13]

John R. Smith and Shih-Fu Chang. 1997. Visual Seek: a fully automated content-based image query system. In Proceedings of the fourth ACM international conference on Multimedia (MULTIMEDIA ’96). Association for Computing Machinery, New York, NY, USA, 87–98. https://doi.org/10.1145/244130.244151

Digital Library

[14]

Mustapha Aatila,Mohamed Lachgar, Ali Kartit.2020. An Overview of Gradient Descent Algorithm Optimization in Machine Learning: Application in the Ophthalmology Field. In 3rd International Conference on Smart Applications and Data Analysis for Smart Cyber-Physical Systems, Springer, Marrakesh, Morocco, 349-359, https://doi.org/10.1007/978-3-030-45183-7_27

[15]

Ye Hao, Li Geoffrey Ye, Juang Biing-Hwang Fred. 2019. Deep reinforcement learning based resource allocation for V2V communications. IEEE T. Veh. Technol., 68, 4(April 2019), 3163-3173. https://doi.org/10.1109/TVT.2019.2897134

Recommendations

Resource allocation and dynamic power control for D2D communication underlaying uplink multi-cell networks

Underlaying device-to-device (D2D) communication is suggested as a promising technology for the next generation cellular networks (5G), where users in close proximity can transmit directly to one another bypassing the base station. However, when D2D ...
Efficient resource allocation algorithm for overlay D2D communication

Device-to-device (D2D) communication in cellular systems offers an economically viable means of improving the system capacity by allocating multiple D2D links within the same resource block (RB), under appropriate interference-control techniques. In ...
D2D Resource Allocation Based on Reinforcement Learning and QoS
Abstract
Device-to-device (D2D) communications is designed to improve the overall network performance, including low latency, high data rates, and system capacity of the fifth-generation (5G) wireless networks. The system capacity can even be improved by ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICCNS '23: Proceedings of the 2023 13th International Conference on Communication and Network Security

December 2023

363 pages

ISBN:9798400707964

DOI:10.1145/3638782

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 April 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICCNS 2023

ICCNS 2023: 2023 13th International Conference on Communication and Network Security

December 6 - 8, 2023

Fuzhou, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
18
Total Downloads

Downloads (Last 12 months)18
Downloads (Last 6 weeks)2

Reflects downloads up to 14 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten