Suivre
Odalric-Ambrym Maillard
Odalric-Ambrym Maillard
Inria Lille - Nord Europe
Adresse e-mail validée de inria.fr - Page d'accueil
Titre
Citée par
Citée par
Année
Kullback-Leibler upper confidence bounds for optimal sequential allocation
O Cappé, A Garivier, OA Maillard, R Munos, G Stoltz
The Annals of Statistics, 1516-1541, 2013
4412013
Concentration inequalities for sampling without replacement
R Bardenet, OA Maillard
2062015
CATS, a low pressure multiwire proportionnal chamber for secondary beam tracking at GANIL
S Ottini-Hustache, C Mazur, F Auger, A Musumarra, N Alamanos, ...
Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 1999
1731999
A finite-time analysis of multi-armed bandits problems with kullback-leibler divergences
OA Maillard, R Munos, G Stoltz
Proceedings of the 24th annual Conference On Learning Theory, 497-514, 2011
1672011
Compressed least-squares regression
OA Maillard, R Munos
Advances in Neural Information Processing Systems, 2009
1362009
Latent Bandits.
OA Maillard, S Mannor
International Conference on Machine Learning, 136-144, 2014
1112014
The non-stationary stochastic multi-armed bandit problem
R Allesiardo, R Féraud, OA Maillard
International Journal of Data Science and Analytics 3, 267-283, 2017
982017
Variance-aware regret bounds for undiscounted reinforcement learning in mdps
MS Talebi, OA Maillard
Algorithmic Learning Theory, 770-805, 2018
832018
Robust risk-averse stochastic multi-armed bandits
OA Maillard
Algorithmic Learning Theory: 24th International Conference, ALT 2013 …, 2013
772013
LSTD with random projections
M Ghavamzadeh, A Lazaric, OA Maillard, R Munos
Advances in Neural Information Processing Systems 23, 721--729, 2010
752010
PICOSEC: Charged particle timing at sub-25 picosecond precision with a Micromegas based detector
J Bortfeldt, F Brunbauer, C David, D Desforge, G Fanourakis, J Franchi, ...
Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 2018
682018
Sub-sampling for multi-armed bandits
A Baransi, OA Maillard, S Mannor
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014
672014
How hard is my MDP?" The distribution-norm to the rescue"
OA Maillard, TA Mann, S Mannor
Advances in Neural Information Processing Systems 27, 2014
612014
Linear regression with random projections
O Maillard, R Munos
Journal of Machine Learning Research 13 (1), 2735-2772, 2012
612012
Online learning in adversarial lipschitz environments
OA Maillard, R Munos
Joint european conference on machine learning and knowledge discovery in …, 2010
562010
Selecting the state-representation in reinforcement learning
OA Maillard, D Ryabko, R Munos
Advances in Neural Information Processing Systems 24, 2011
502011
Finite-sample analysis of Bellman residual minimization
OA Maillard, R Munos, A Lazaric, M Ghavamzadeh
Proceedings of 2nd Asian Conference on Machine Learning, 299-314, 2010
492010
Reinforcement learning for crop management support: Review, prospects and challenges
R Gautron, OA Maillard, P Preux, M Corbeels, R Sabbadin
Computers and Electronics in Agriculture 200, 107182, 2022
472022
Tightening exploration in upper confidence reinforcement learning
H Bourel, O Maillard, MS Talebi
International Conference on Machine Learning, 1056-1066, 2020
452020
Optimal thompson sampling strategies for support-aware cvar bandits
D Baudry, R Gautron, E Kaufmann, O Maillard
International Conference on Machine Learning, 716-726, 2021
412021
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20