Peter Henderson
TitelGeciteerd doorJaar
Deep Reinforcement Learning that Matters
P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger
AAAI Conference on Artificial Intelligence (AAAI), 2018
1762018
A Survey of Available Corpora For Building Data-Driven Dialogue Systems: The Journal Version
IV Serban, R Lowe, P Henderson, L Charlin, J Pineau
Dialogue & Discourse 9 (1), 1-49, 2018
97*2018
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
R Islam, P Henderson, M Gomrokchi, D Precup
Reproducibility in Machine Learning Workshop (ICML), 2017
442017
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
P Henderson, WD Chang, PL Bacon, D Meger, J Pineau, D Precup
AAAI Conference on Artificial Intelligence (AAAI), 2018
112018
Underwater Multi-Robot Convoying using Visual Tracking by Detection
F Shkurti, WD Chang, P Henderson, MJ Islam, JCG Higuera, J Li, ...
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017
102017
Ethical Challenges in Data-Driven Dialogue Systems
P Henderson, K Sinha, N Angelard-Gontier, NR Ke, G Fried, R Lowe, ...
AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES), 2018
62018
An Introduction to Deep Reinforcement Learning
V François-Lavet, P Henderson, R Islam, MG Bellemare, J Pineau
Foundations and Trends® in Machine Learning 11 (3-4), 219-354, 2018
42018
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
P Henderson, T Doan, R Islam, D Meger
Bayesian Deep Learning Workshop (NeurIPS), 2017
42017
Benchmark Environments for Multitask Learning in Continuous Domains
P Henderson, WD Chang, F Shkurti, J Hansen, D Meger, G Dudek
Lifelong Learning: A Reinforcement Learning Approach Workshop (ICML), 2017
42017
Learning Robust Dialog Policies in Noisy Environments
M Fazel-Zarandi, SW Li, J Cao, J Casale, P Henderson, D Whitney, ...
Conversational AI Workshop (NeurIPS), 2017
32017
Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods
P Henderson, J Romoff, J Pineau
The 14th European Workshop on Reinforcement Learning (EWRL 2018), 2018
22018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
J Romoff, A Piche, P Henderson, V Francois-Lavet, J Pineau
International Conference on Learning Representations Workshop (ICLR), 2018
22018
Separating value functions across time-scales
J Romoff, P Henderson, A Touati, Y Ollivier, E Brunskill, J Pineau
arXiv preprint arXiv:1902.01883, 2019
12019
Implanted intracortical electrodes as chronic neural interfaces to the central nervous system
P Henderson
PeerJ PrePrints 3 (2167-9843), e1536, 2015
12015
Distilling Information from a Flood: A Possibility for the Use of Meta-Analysis and Systematic Review in Machine Learning Research
P Henderson, E Brunskill
Critiquing and Correcting Trends in Machine Learning Workshop (NeurIPS), 2018
2018
The RLLChatbot: a solution to the ConvAI challenge
N Gontier, K Sinha, P Henderson, I Serban, M Noseworthy, ...
arXiv preprint arXiv:1811.02714, 2018
2018
Adversarial Gain
P Henderson, K Sinha, RN Ke, J Pineau
arXiv preprint arXiv:1811.01302, 2018
2018
Oríon: Experiment Version Control for Efficient Hyperparameter Optimization
C Tsirigotis, X Bouthillier, F Corneau-Tremblay, P Henderson, R Askari, ...
Reproducibility in Machine Learning Workshop (ICML), 2018
2018
Reproducibility and Reusability in Deep Reinforcement Learning
P Henderson
McGill University, 2018
2018
Cost Adaptation for Robust Decentralized Swarm Behaviour
P Henderson, M Vertescher, D Meger, M Coates
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018
2018
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20