Proximal Policy Optimization With Policy Feedback | IEEE Journals & Magazine | IEEE Xplore