A Class of Optimal Control Problem for Stochastic Discrete-Time Systems with Average Reward Reinforcement Learning | IEEE Conference Publication | IEEE Xplore