Output feedback reinforcement Q-learning control for the discrete-time linear quadratic regulator problem | IEEE Conference Publication | IEEE Xplore