Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data | IEEE Journals & Magazine | IEEE Xplore