Adaptive demand response: Online learning of restless and controlled bandits | IEEE Conference Publication | IEEE Xplore