Discounted UCB1-tuned for Q-learning | IEEE Conference Publication | IEEE Xplore