Model-free Q-learning optimal resource allocation in uncertain communication networks | IEEE Conference Publication | IEEE Xplore