Real-time sequentially decision for optimal action using prediction of the state-action pair | IEEE Conference Publication | IEEE Xplore