Multi-Agent Cooperative Alternating Q-Learning Caching in D2D-Enabled Cellular Networks | IEEE Conference Publication | IEEE Xplore