Follow
Brett Daley
Title
Cited by
Cited by
Year
Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
X Lyu, Y Xiao, B Daley, C Amato
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
1422021
Reconciling λ-Returns with Experience Replay
B Daley, C Amato
Advances in Neural Information Processing Systems, 1133-1142, 2019
512019
NUPAR: A Benchmark Suite for Modern GPU Architectures
Y Ukidave, FN Paravecino, L Yu, C Kalra, A Momeni, Z Chen, N Materise, ...
Proceedings of the 6th ACM/SPEC International Conference on Performance …, 2015
452015
Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
H Nguyen, B Daley, X Song, C Amato, R Platt
Conference on Robot Learning, 2020
142020
On centralized critics in multi-agent reinforcement learning
X Lyu, A Baisero, Y Xiao, B Daley, C Amato
Journal of Artificial Intelligence Research 77, 295-354, 2023
112023
Asymmetric DQN for partially observable reinforcement learning
A Baisero, B Daley, C Amato
Uncertainty in Artificial Intelligence, 107-117, 2022
112022
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
B Daley, M White, C Amato, MC Machado
International Conference on Machine Learning, 2023
5*2023
Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
B Daley, C Hickert, C Amato
Autonomous Agents and Multiagent Systems, 2021
42021
Demystifying the Recency Heuristic in Temporal-Difference Learning
B Daley, MC Machado, M White
Reinforcement Learning Conference, 2024
12024
Averaging -step Returns Reduces Variance in Reinforcement Learning
B Daley, M White, MC Machado
International Conference on Machine Learning, 2024
12024
Expectigrad: Fast Stochastic Optimization with Robust Convergence Properties
B Daley, C Amato
arXiv preprint arXiv:2010.01356, 2020
12020
Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning
B Daley, I Chan
Reinforcement Learning and Decision Making, 2022
2022
Virtual Replay Cache
B Daley, C Amato
arXiv preprint arXiv:2112.03421, 2021
2021
Human-Level Control without Server-Grade Hardware
B Daley, C Amato
arXiv preprint arXiv:2111.01264, 2021
2021
Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
B Daley, C Amato
arXiv preprint arXiv:2106.05449, 2021
2021
Gym Classics
B Daley
GitHub repository, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–16