ABSTRACT
Traditional machine learning methods used in marketing and digital commerce applications, including propensity scoring and many recommendation algorithms, are usually focused on improving short-term outcomes such as click-through rates. In many environments, however, long-term customer engagement can be more important than immediate outcomes. In this paper, we describe several real-world case studies on building personalization engines that address this problem using reinforcement learning (RL) methods. We also discuss the design patterns used to create such solutions.
- Jason Gauci, Edoardo Conti, Yitao Liang, Kittipat Virochsiri, Yuchen He, Zachary Kaden, Vivek Narayanan, Xiaohui Ye, Zhengxing Chen, and Scott Fujimoto. 2018. Horizon: Facebook's open source applied reinforcement learning platform. arXiv preprint arXiv:1811.00260 (2018).Google Scholar
- Lihong Li, Wei Chu, John Langford, and Robert E Schapire. 2010. A contextualbandit approach to personalized news article recommendation. In Proceedings of the 19th international conference on World wide web. 661--670.Google ScholarDigital Library
- Georgios Theocharous, Philip S Thomas, and Mohammad Ghavamzadeh. 2015. Personalized ad recommendation systems for life-time value optimization with guarantees. In Twenty-Fourth International Joint Conference on Artificial Intelligence.Google Scholar
Index Terms
- Building Next Best Action Engines for B2C and B2B Use Cases
Recommendations
A review of reinforcement learning for autonomous building energy management
AbstractThe area of building energy management has received a significant amount of interest in recent years. This area is concerned with combining advancements in sensor technologies, communications and advanced control algorithms to optimize ...
Efficient Reinforcement Learning Using State-Action Uncertainty with Multiple Heads
Artificial Neural Networks and Machine Learning – ICANN 2023AbstractIn reinforcement learning, an agent learns optimal actions for achieving a task by maximizing rewards in an environment. During learning, the agent decides its action for exploration or exploitation at each time. In exploration, the agent searches ...
Usage-based web recommendations: a reinforcement learning approach
RecSys '07: Proceedings of the 2007 ACM conference on Recommender systemsInformation overload is no longer news; the explosive growth of the Internet has made this issue increasingly serious for Web users. Users are very often overwhelmed by the huge amount of information and are faced with a big challenge to find the most ...
Comments