Temporal Cross-Selling Optimization Using Action Proxy-Driven Reinforcement Learning | IEEE Conference Publication | IEEE Xplore