Duplicated Replay Buffer for Asynchronous Deep Deterministic Policy Gradient | IEEE Conference Publication | IEEE Xplore