Learning Policies for Markov Decision Processes From Data | IEEE Journals & Magazine | IEEE Xplore