Loading [a11y]/accessibility-menu.js
Dynamic Spectrum Anti-Jamming With Reinforcement Learning Based on Value Function Approximation | IEEE Journals & Magazine | IEEE Xplore

Dynamic Spectrum Anti-Jamming With Reinforcement Learning Based on Value Function Approximation


Abstract:

This letter addresses the spectrum anti-jamming problem with multiple Internet of Things (IoT) devices for uplink transmissions, where policies for configuring frequency-...Show More

Abstract:

This letter addresses the spectrum anti-jamming problem with multiple Internet of Things (IoT) devices for uplink transmissions, where policies for configuring frequency-domain channels have to be learned without the knowledge of the time-frequency distribution of the interference. The problem of decision-making or learning is expected to be solved by reinforcement learning (RL) approaches. However, the state-of-the-art RL-based spectrum anti-jamming methods may not be applicable in IoT systems, suffer from high computational complexity or may converge to a policy that may not be the best for each user. Therefore, we propose a novel spectrum anti-jamming scheme where configuration policies for the IoT devices are sequentially optimized with value function approximation-based multi-agent RL. Simulation results show that our proposed algorithm outperforms various baselines in terms of average normalized throughput.
Published in: IEEE Wireless Communications Letters ( Volume: 12, Issue: 2, February 2023)
Page(s): 386 - 390
Date of Publication: 09 December 2022

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.