Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning X Lyu, Y Xiao, B Daley, C Amato Proceedings of the 20th International Conference on Autonomous Agents and …, 2021 | 142 | 2021 |
Reconciling λ-Returns with Experience Replay B Daley, C Amato Advances in Neural Information Processing Systems, 1133-1142, 2019 | 51 | 2019 |
NUPAR: A Benchmark Suite for Modern GPU Architectures Y Ukidave, FN Paravecino, L Yu, C Kalra, A Momeni, Z Chen, N Materise, ... Proceedings of the 6th ACM/SPEC International Conference on Performance …, 2015 | 45 | 2015 |
Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability H Nguyen, B Daley, X Song, C Amato, R Platt Conference on Robot Learning, 2020 | 14 | 2020 |
On centralized critics in multi-agent reinforcement learning X Lyu, A Baisero, Y Xiao, B Daley, C Amato Journal of Artificial Intelligence Research 77, 295-354, 2023 | 11 | 2023 |
Asymmetric DQN for partially observable reinforcement learning A Baisero, B Daley, C Amato Uncertainty in Artificial Intelligence, 107-117, 2022 | 11 | 2022 |
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning B Daley, M White, C Amato, MC Machado International Conference on Machine Learning, 2023 | 5* | 2023 |
Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning B Daley, C Hickert, C Amato Autonomous Agents and Multiagent Systems, 2021 | 4 | 2021 |
Demystifying the Recency Heuristic in Temporal-Difference Learning B Daley, MC Machado, M White Reinforcement Learning Conference, 2024 | 1 | 2024 |
Averaging -step Returns Reduces Variance in Reinforcement Learning B Daley, M White, MC Machado International Conference on Machine Learning, 2024 | 1 | 2024 |
Expectigrad: Fast Stochastic Optimization with Robust Convergence Properties B Daley, C Amato arXiv preprint arXiv:2010.01356, 2020 | 1 | 2020 |
Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning B Daley, I Chan Reinforcement Learning and Decision Making, 2022 | | 2022 |
Virtual Replay Cache B Daley, C Amato arXiv preprint arXiv:2112.03421, 2021 | | 2021 |
Human-Level Control without Server-Grade Hardware B Daley, C Amato arXiv preprint arXiv:2111.01264, 2021 | | 2021 |
Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods B Daley, C Amato arXiv preprint arXiv:2106.05449, 2021 | | 2021 |
Gym Classics B Daley GitHub repository, 2021 | | 2021 |