Learning an Optimal Control Policy for a Markov Decision Process Under Linear Temporal Logic Specifications | IEEE Conference Publication | IEEE Xplore