Cooperative channel assignment for VANETs based on multiagent reinforcement learning

Wang, Yun-peng; Zheng, Kun-xian; Tian, Da-xin; Duan, Xu-ting; Zhou, Jian-shan

doi:10.1631/FITEE.1900308

Cooperative channel assignment for VANETs based on multiagent reinforcement learning

Published: 29 July 2020

Volume 21, pages 1047–1058, (2020)
Cite this article

Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Yun-peng Wang¹,
Kun-xian Zheng ORCID: orcid.org/0000-0002-2887-9294¹,
Da-xin Tian¹,
Xu-ting Duan¹ &
…
Jian-shan Zhou¹

192 Accesses
3 Altmetric
Explore all metrics

Abstract

Dynamic channel assignment (DCA) plays a key role in extending vehicular ad-hoc network capacity and mitigating congestion. However, channel assignment under vehicular direct communication scenarios faces mutual influence of large-scale nodes, the lack of centralized coordination, unknown global state information, and other challenges. To solve this problem, a multiagent reinforcement learning (RL) based cooperative DCA (RL-CDCA) mechanism is proposed. Specifically, each vehicular node can successfully learn the proper strategies of channel selection and backoff adaptation from the real-time channel state information (CSI) using two cooperative RL models. In addition, neural networks are constructed as nonlinear Q-function approximators, which facilitates the mapping of the continuously sensed input to the mixed policy output. Nodes are driven to locally share and incorporate their individual rewards such that they can optimize their policies in a distributed collaborative manner. Simulation results show that the proposed multiagent RL-CDCA can better reduce the one-hop packet delay by no less than 73.73%, improve the packet delivery ratio by no less than 12.66% on average in a highly dense situation, and improve the fairness of the global network resource allocation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-user reinforcement learning based multi-reward for spectrum access in cognitive vehicular networks

Article 15 April 2023

Schedule-Based Cooperative Multi-agent Reinforcement Learning for Multi-channel Communication in Wireless Sensor Networks

Article 16 September 2021

A Local Collaborative Distributed Reinforcement Learning Approach for Resource Allocation in V2X Networks

References

Ahmed SAM, Ariffin SHS, Fisal N, 2013. Overview of wireless access in vehicular environment (wave) protocols and standards. Ind J Sci Technol, 7(6):4994–5001. https://doi.org/10.17485/ijst/2013/v6i7/34355
Google Scholar
Ahmed T, Le Moullec Y, 2017. A QoS optimization approach in cognitive body area networks for healthcare applications. Sensors, 17(4):780. https://doi.org/10.3390/s17040780
Article Google Scholar
Ahmed T, Ahmed F, Le Moullec Y, 2017. Optimization of channel allocation in wireless body area networks by means of reinforcement learning. IEEE Asia Pacific Conf on Wireless and Mobile, p.120–123. https://doi.org/10.1109/APWiMob.2016.7811445
Almohammedi AA, Noordin NK, Sali A, et al., 2017. An adaptive multi-channel assignment and coordination scheme for IEEE 802.11p/1609.4 in vehicular ad-hoc networks. IEEE Access, 6:2781–2802. https://doi.org/10.1109/ACCESS.2017.2785309
Article Google Scholar
Arulkumaran K, Deisenroth MP, Brundage M, et al., 2017. A brief survey of deep reinforcement learning. IEEE Signal Process Mag, 34(6):26–38. https://doi.org/10.1109/MSP.2017.2743240
Article Google Scholar
Atallah R, Assi C, Khabbaz M, 2017. Deep reinforcement learning-based scheduling for roadside communication networks. 15^th Int Sympon Modeling and Optimization in Mobile, p.1–8. https://doi.org/10.23919/WIOPT.2017.7959912
Audhya GK, Sinha K, Ghosh SC, et al., 2011. A survey on the channel assignment problem in wireless networks. Wirel Commun Mob Comput, 11(5):583–609. https://doi.org/10.1002/wcm.898
Article Google Scholar
Barto AG, Sutton RS, 1998. Reinforcement Learning: an Introduction. MIT Press, Cambridge, MA, USA.
MATH Google Scholar
Cheeneebash J, Lozano JA, Rughooputh HCS, 2012. A survey on the algorithms used to solve the channel assignment problem. Rec Pat Telecommun, 1(1):54–71. https://doi.org/10.2174/2211740711201010054
Article Google Scholar
He Y, Zhao N, Yin HX, 2017. Integrated networking, caching, and computing for connected vehicles: a deep reinforcement learning approach. IEEE Trans Veh Technol, 67(1):44–55. https://doi.org/10.1109/TVT.2017.2760281
Article Google Scholar
Jain RK, Chiu DMW, Hawe WR, 1998. A Quantitative Measure of Fairness and Discrimination for Resource Allocation in Shared Computer Systems. CoRR. cs.NI/9809099, DEC, Hudson, Canada.
Kaelbling LP, Littman ML, Moore AW, 1996. Reinforcement learning: a survey. J Artif Intell Res, 4(1):237–285. https://doi.org/10.1613/jair.301
Article Google Scholar
Li L, Lv YS, Wang FY, 2016. Traffic signal timing via deep reinforcement learning. IEEE/CAA J Autom Sin, 3(3):247–254. https://doi.org/10.1109/JAS.2016.7508798
Article MathSciNet Google Scholar
Li XH, Hu BJ, Chen HB, et al., 2015. An RSU-coordinated synchronous multi-channel MAC scheme for vehicular ad hoc networks. IEEE Access, 3:2794–2802. https://doi.org/10.1109/ACCESS.2015.2509458
Article Google Scholar
Liu N, Li Z, Xu JL, et al., 2017. A hierarchical framework of cloud resource allocation and power management using deep reinforcement learning. IEEE 37^th Int Conf on Distributed Computing Systems, p.372–382. https://doi.org/10.1109/ICDCS.2017.123
Liu SJ, Hu X, Wang WD, 2018. Deep reinforcement learning based dynamic channel allocation algorithm in multi-beam satellite systems. IEEE Access, 6:15733–15742. https://doi.org/10.1109/ACCESS.2018.2809581
Article Google Scholar
Louta M, Sarigiannidis P, Misra S, et al., 2014. RLAM: a dynamic and efficient reinforcement learning-based adaptive mapping scheme in mobile WiMAX networks. Mob Inform Syst, 10(2):173–196. https://doi.org/10.1155/2014/213056
Article Google Scholar
Maddison CJ, Huang A, Sutskever I, et al., 2014. Move evaluation in go using deep convolutional neural networks. https://arxiv.org/abs/1412.6564
Mao HZ, Alizadeh M, Menache I, et al., 2016. Resource management with deep reinforcement learning. Proc 15^th ACM Workshop on Hot Topics in Networks, p.50–56. https://doi.org/10.1145/3005745.3005750
Mnih V, Kavukcuoglu K, Silver D, et al., 2013. Playing Atari with deep reinforcement learning. https://arxiv.org/abs/1312.5602
Mnih V, Kavukcuoglu K, Silver D, et al., 2015. Human-level control through deep reinforcement learning. Nature, 518(7540):529–533. https://doi.org/10.1038/nature14236
Article Google Scholar
Nie JH, Haykin S, 1999. A dynamic channel assignment policy through Q-learning. IEEE Trans Neur Netw, 10(6): 1443–1455. https://doi.org/10.1109/72.809089
Article Google Scholar
Ouyous M, Zytoune O, Aboutajdine D, 2017. Multi-channel coordination based MAC protocols in vehicular ad hoc networks (VANETs): a survey. In: El-Azouzi R, Menasche D, Sabir E, et al. (Eds.), Advances in Ubiquitous Networking 2. Springer, Singapore. https://doi.org/10.1007/978-981-10-1627-1_7
Google Scholar
Qiu CR, Hu Y, Chen Y, et al., 2019. Deep deterministic policy gradient (DDPG)-based energy harvesting wireless communications. IEEE Int Things J, 6(5):8577–8588. https://doi.org/10.1109/JIOT.2019.2921159
Article Google Scholar
Seah MWM, Tham CK, Srinivasan V, et al., 2007. Achieving coverage through distributed reinforcement learning in wireless sensor networks. 3^rd Int Conf on Intelligent Sensors, Sensor Networks and Information, p.425–430. https://doi.org/10.1109/ISSNIP.2007.4496881
Silver D, Schrittwieser J, Simonyan K, et al., 2017. Mastering the game of go without human knowledge. Nature, 550(7676):354–350. https://doi.org/10.1038/nature24270
Article Google Scholar
Wang Q, Leng S, Fu HR, et al., 2012. An IEEE 802.11p-based multichannel MAC scheme with channel coordination for vehicular ad hoc networks. IEEE Trans Intell Trans Syst, 13(2):449–458. https://doi.org/10.1109/tits.2011.2171951
Article Google Scholar
Wang W, Kwasinski A, Niyato D, et al., 2017. A survey on applications of model-free strategy learning in cognitive wireless networks. IEEE Commun Surv Tutor, 18(3):1717–1757. https://doi.org/10.1109/COMST.2016.2539923
Article Google Scholar
Xu ZY, Wang YZ, Tang J, et al., 2017. A deep reinforcement learning based framework for power-efficient resource allocation in cloud RANs. IEEE Int Conf on Communications, p.1–6. https://doi.org/10.1109/ICC.2017.7997286
Yau KLA, Komisarczuk P, Paul DT, 2010. Enhancing network performance in distributed cognitive radio networks using single-agent and multi-agent reinforcement learning. IEEE Local Computer Network Conf, p.152–159. https://doi.org/10.1109/LCN.2010.5735689
Ye H, Li GY, and Juang BHF, 2018. Deep reinforcement learning based resource allocation for V2V communications. IEEE Int Conf on Communications, p.1–6. https://doi.org/10.1109/ICC.2018.8422586

Download references

Author information

Authors and Affiliations

Beijing Advanced Innovation Center for Big Data and Brain Computing, School of Transportation Science and Engineering, Beihang University, Beijing, 100191, China
Yun-peng Wang, Kun-xian Zheng, Da-xin Tian, Xu-ting Duan & Jian-shan Zhou

Authors

Yun-peng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Kun-xian Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Da-xin Tian
View author publications
You can also search for this author in PubMed Google Scholar
Xu-ting Duan
View author publications
You can also search for this author in PubMed Google Scholar
Jian-shan Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Yun-peng WANG designed the research. Kun-xian ZHENG processed the data and drafted the manuscript. Da-xin TIAN and Xu-ting DUAN helped organize the manuscript. Kun-xian ZHENG and Jian-shan ZHOU revised and finalized the paper.

Corresponding author

Correspondence to Kun-xian Zheng.

Ethics declarations

Yun-peng WANG, Kun-xian ZHENG, Da-xin TIAN, Xu-ting DUAN, and Jian-shan ZHOU declare that they have no conflict of interest.

Additional information

Project supported by the National Natural Science Foundation of China (Nos. 61672082 and 61822101), the Beijing Municipal Natural Science Foundation, China (No. 4181002), and the Beihang University Innovation and Practice Fund for Graduate, China (No. YCSJ-02-2018-05)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Yp., Zheng, Kx., Tian, Dx. et al. Cooperative channel assignment for VANETs based on multiagent reinforcement learning. Front Inform Technol Electron Eng 21, 1047–1058 (2020). https://doi.org/10.1631/FITEE.1900308

Download citation

Received: 21 June 2019
Accepted: 03 January 2020
Published: 29 July 2020
Issue Date: July 2020
DOI: https://doi.org/10.1631/FITEE.1900308

Key words

CLC number

U495

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cooperative channel assignment for VANETs based on multiagent reinforcement learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-user reinforcement learning based multi-reward for spectrum access in cognitive vehicular networks

Schedule-Based Cooperative Multi-agent Reinforcement Learning for Multi-channel Communication in Wireless Sensor Networks

A Local Collaborative Distributed Reinforcement Learning Approach for Resource Allocation in V2X Networks

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Subscribe and save

Buy Now

Navigation

Cooperative channel assignment for VANETs based on multiagent reinforcement learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-user reinforcement learning based multi-reward for spectrum access in cognitive vehicular networks

Schedule-Based Cooperative Multi-agent Reinforcement Learning for Multi-channel Communication in Wireless Sensor Networks

A Local Collaborative Distributed Reinforcement Learning Approach for Resource Allocation in V2X Networks

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Subscribe and save

Buy Now

Search

Navigation