A comparison of action selection methods for implicit policy method reinforcement learning in continuous action-space | IEEE Conference Publication | IEEE Xplore