Kernel-Based Direct Policy Search Reinforcement Learning Based on Variational Bayesian Inference | IEEE Conference Publication | IEEE Xplore