Conferences >2012 International Symposium ...

Dynamic Spectrum Access in realistic environments using reinforcement learning

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We study the use of reinforcement learning to model Dynamic Spectrum Access in a realistic multi-channel environment. Three different approaches from the literature on th...Show More

Metadata

Abstract:

We study the use of reinforcement learning to model Dynamic Spectrum Access in a realistic multi-channel environment. Three different approaches from the literature on the multi-armed bandit problem are compared on a set of realistic channel access models - two are based on stochastic models of the channel occupancy, while a third assumes an adversarial model. The algorithms are experimentally tested on channels occupied by primary users that behave according to a simple fair scheduler and a semi-Markov model based on WLAN traffic measurements; models that generate more realistic channel occupancy patterns than allowed by fixed i.i.d. probability models. The experiments show that the UCB1 algorithm of Auer et. al. [1] outperforms the other algorithms, and we support these findings using some simple theoretical results.

Published in: 2012 International Symposium on Communications and Information Technologies (ISCIT)

Date of Conference: 02-05 October 2012

Date Added to IEEE Xplore: 13 December 2012

ISBN Information:

DOI: 10.1109/ISCIT.2012.6380943

Conference Location: Gold Coast, QLD, Australia