Reinforcement learning with nonstationary reward depending on the episode | IEEE Conference Publication | IEEE Xplore