epsilon -optimal discretized linear reward-penalty learning automata | IEEE Journals & Magazine | IEEE Xplore