Partially observable Markov decision processes with reward information | IEEE Conference Publication | IEEE Xplore