Reinforcement-Learning-Based Resource Allocation for Energy-Harvesting-Aided D2D Communications in IoT Networks | IEEE Journals & Magazine | IEEE Xplore