Cited By
View all- Xu JLiu BZhao XWang X(2024)Online reinforcement learning for condition-based group maintenance using factored Markov decision processesEuropean Journal of Operational Research10.1016/j.ejor.2023.11.039315:1(176-190)Online publication date: May-2024
- Cui QXiong ZFazel MDu SKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Learning in congestion games with bandit feedbackProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3601070(11009-11022)Online publication date: 28-Nov-2022
- Rosenberg AMansour YRanzato MBeygelzimer ADauphin YLiang PVaughan J(2021)Oracle-efficient regret minimization in factored MDPs with unknown structureProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3541113(11148-11159)Online publication date: 6-Dec-2021
- Show More Cited By