A data-based online reinforcement learning algorithm with high-efficient exploration | IEEE Conference Publication | IEEE Xplore