From minimax value to low-regret algorithms for online Markov decision processes | IEEE Conference Publication | IEEE Xplore