Self learning control of constrained Markov chains - a gradient approach | IEEE Conference Publication | IEEE Xplore