Higher order Q-Learning | IEEE Conference Publication | IEEE Xplore