Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks | IEEE Conference Publication | IEEE Xplore