Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games | IEEE Conference Publication | IEEE Xplore