Cited By
View all- Kang LLiu YLuo YYang JYuan HZhu C(2025)Approximate Policy Iteration With Deep Minimax Average Bellman Error MinimizationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.334699236:2(2288-2299)Online publication date: Feb-2025
- Che FXiao CMei JDai BGummadi RRamirez OHarris CMahmood ASchuurmans DSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Target networks and over-parameterization stabilize off-policy bootstrapping with function approximationProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692316(6372-6396)Online publication date: 21-Jul-2024
- Dong JYang LKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Does sparsity help in learning misspecified linear bandits?Proceedings of the 40th International Conference on Machine Learning10.5555/3618408.3618741(8317-8333)Online publication date: 23-Jul-2023
- Show More Cited By