Learning efficiently function approximation for contextual mdp O Levy, Y Mansour arXiv preprint arXiv:2203.00995, 2022 | 7 | 2022 |
Optimism in face of a context: Regret guarantees for stochastic contextual MDP O Levy, Y Mansour Proceedings of the AAAI Conference on Artificial Intelligence 37 (7), 8510-8517, 2023 | 6 | 2023 |
Eluder-based Regret for Stochastic Contextual MDPs O Levy, A Cassel, A Cohen, Y Mansour arXiv preprint arXiv:2211.14932, 2022 | 2 | 2022 |
Efficient rate optimal regret for adversarial contextual MDPs using online function approximation O Levy, A Cohen, A Cassel, Y Mansour International Conference on Machine Learning, 19287-19314, 2023 | 1 | 2023 |
Counterfactual Optimism: Rate Optimal Regret for Stochastic Contextual MDPs. O Levy, AB Cassel, A Cohen, Y Mansour CoRR, 2022 | | 2022 |