Hardware-Friendly Actor-Critic Reinforcement Learning Through Modulation of Spike-Timing-Dependent Plasticity | IEEE Journals & Magazine | IEEE Xplore