A Q-learning based Dynamic Power Control Algorithm for D2D Communication Underlaying Cellular Networks | IEEE Conference Publication | IEEE Xplore