Time Optimal Data Harvesting in Two Dimensions through Reinforcement Learning Without Engineered Reward Functions | IEEE Conference Publication | IEEE Xplore