Volgen
AdriÓ PuigdomŔnech Badia
AdriÓ PuigdomŔnech Badia
DeepMind
Geverifieerd e-mailadres voor google.com
Titel
Geciteerd door
Geciteerd door
Jaar
Asynchronous methods for deep reinforcement learning
V Mnih, A Puigdomenech Badia, M Mirza, A Graves, T Lillicrap, T Harley, ...
International conference on machine learning, 1928-1937, 2016
74402016
Hybrid computing using a neural network with dynamic external memory
A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ...
Nature 538 (7626), 471-476, 2016
15062016
Imagination-augmented agents for deep reinforcement learning
S RacaniŔre, T Weber, D Reichert, L Buesing, A Guez, ...
Advances in neural information processing systems 30, 2017
533*2017
Agent57: Outperforming the atari human benchmark
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ...
International Conference on Machine Learning, 507-517, 2020
3152020
Neural episodic control
A Pritzel, B Uria, S Srinivasan, AP Badia, O Vinyals, D Hassabis, ...
International Conference on Machine Learning, 2827-2836, 2017
2812017
Never give up: Learning directed exploration strategies
AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ...
arXiv preprint arXiv:2002.06038, 2020
1382020
Proceedings of the 33rd International Conference on Machine Learning
V Mnih, AP Badia, M Mirza, A Graves, T Lillicrap, T Harley, D Silver, ...
PMLR 48, 1928-1937, 2016
842016
Memory-based parameter adaptation
P Sprechmann, SM Jayakumar, JW Rae, A Pritzel, AP Badia, B Uria, ...
arXiv preprint arXiv:1802.10542, 2018
742018
Generalization of reinforcement learners with working and episodic memory
M Fortunato, M Tan, R Faulkner, S Hansen, A PuigdomŔnech Badia, ...
Advances in neural information processing systems 32, 2019
372019
Memo: A deep network for flexible combination of episodic memories
A Banino, AP Badia, R K÷ster, MJ Chadwick, V Zambaldi, D Hassabis, ...
arXiv preprint arXiv:2001.10913, 2020
262020
Asynchronous deep reinforcement learning
V Mnih, AP Badia, AB Graves, TJA Harley, D Silver, K Kavukcuoglu
US Patent 10,936,946, 2021
112021
Beyond fine-tuning: Transferring behavior in reinforcement learning
V Campos, P Sprechmann, S Hansen, A Barreto, S Kapturowski, ...
arXiv preprint arXiv:2102.13515, 2021
52021
Coverage as a principle for discovering transferable behavior in reinforcement learning
V Campos, P Sprechmann, SS Hansen, A Barreto, C Blundell, A Vitvitskyi, ...
52020
Agent57: Outperforming the Atari Human Benchmark. arXiv e-prints, page
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, D Guo, ...
arXiv preprint arXiv:2003.13350, 2020
32020
Retrieval-augmented reinforcement learning
A Goyal, A Friesen, A Banino, T Weber, NR Ke, AP Badia, A Guez, ...
International Conference on Machine Learning, 7740-7765, 2022
22022
The CLRS Algorithmic Reasoning Benchmark.
P Velickovic, AP Badia, D Budden, R Pascanu, A Banino, M Dashevskiy, ...
CoRR, 2022
22022
Asynchronous deep reinforcement learning
V Mnih, AP Badia, AB Graves, TJA Harley, D Silver, K Kavukcuoglu
US Patent 11,334,792, 2022
12022
Jointly learning exploratory and non-exploratory action selection policies
AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ...
US Patent App. 16/881,180, 2020
12020
Human-level Atari 200x faster
S Kapturowski, V Campos, R Jiang, N Rakićević, H van Hasselt, ...
arXiv preprint arXiv:2209.07550, 2022
2022
Reinforcement learning using baseline and policy neural networks
V Mnih, AP Badia, AB Graves, TJA Harley, D Silver, K Kavukcuoglu
US Patent App. 17/733,594, 2022
2022
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20