Policy Evaluation in Continuous MDPs With Efficient Kernelized Gradient Temporal Difference | IEEE Journals & Magazine | IEEE Xplore