Reinforcement learning via kernel temporal difference | IEEE Conference Publication | IEEE Xplore