Loading [MathJax]/extensions/MathZoom.js
The Multi-Armed Bandit With Stochastic Plays | IEEE Journals & Magazine | IEEE Xplore

The Multi-Armed Bandit With Stochastic Plays


Abstract:

We extend the stochastic multi-armed bandit to the case where the number of arms to play evolves as a stationary process. Our work is motivated by demand response in powe...Show More

Abstract:

We extend the stochastic multi-armed bandit to the case where the number of arms to play evolves as a stationary process. Our work is motivated by demand response in power systems, in which the number of arms to play, or loads to dispatch, depends on a random power imbalance. We give an upper confidence bound-based algorithm that achieves sublinear pseudo-regret. We apply our results in several examples from demand response.
Published in: IEEE Transactions on Automatic Control ( Volume: 63, Issue: 7, July 2018)
Page(s): 2280 - 2286
Date of Publication: 23 October 2017

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.