Bilal Piot

Geciteerd door

	Alles	Sinds 2019
Citaties	15000	14152
h-index	36	33
i10-index	46	44

4500

2250

1125

3375

2014201520162017201820192020202120222023202448 43 91 128 467 842 1324 2455 3656 4464 1398

Openbare toegang

Alles bekijken

3 artikelen

0 artikelen

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Geverifieerd e-mailadres voor univ-lille.fr
Mohammad Gheshlaghi AzarCohere AIGeverifieerd e-mailadres voor google.com
Zhaohan Daniel GuoDeepMindGeverifieerd e-mailadres voor google.com
Rémi MunosDeepMindGeverifieerd e-mailadres voor inria.fr
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindGeverifieerd e-mailadres voor meta.com
Florent AltchéResearch Engineer, DeepMindGeverifieerd e-mailadres voor google.com
Jean-bastien GrillGeverifieerd e-mailadres voor google.com
Florian STRUBDeepMindGeverifieerd e-mailadres voor google.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Geverifieerd e-mailadres voor univ-lorraine.fr
Corentin TallecDeepMindGeverifieerd e-mailadres voor google.com
Pierre RichemondGoogle DeepMindGeverifieerd e-mailadres voor deepmind.com
Charles BlundellResearch Scientist at DeepMindGeverifieerd e-mailadres voor google.com
Todd HesterWaymoGeverifieerd e-mailadres voor waymo.com
Pablo SprechmannResearch Scientist at Google DeepMindGeverifieerd e-mailadres voor google.com
Steven KapturowskiDeepMindGeverifieerd e-mailadres voor google.com
Mel VecerikDeepMind, University College LondonGeverifieerd e-mailadres voor ucl.ac.uk
Dan HorganGoogle DeepMindGeverifieerd e-mailadres voor google.com
Adrià Puigdomènech BadiaDeepMindGeverifieerd e-mailadres voor google.com
Alex VitvitskyiDeepMindGeverifieerd e-mailadres voor google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLGeverifieerd e-mailadres voor google.com

Volgen

Bilal Piot

Google Deepmind

Geverifieerd e-mailadres voor google.com

reinforcement learning inverse reinforcement learning


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
Bootstrap your own latent: A new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... arXiv preprint arXiv:2006.07733, 2020	5828	2020
Rainbow: Combining improvements in deep reinforcement learning M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2484	2018
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1162	2018
Noisy Networks for Exploration M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ... arXiv preprint arXiv:1706.10295 2018, 2017	1114*	2017
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ... arXiv preprint arXiv:1707.08817, 2017	741	2017
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020	594	2020
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	316	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	229	2020
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017	175	2017
Mastering the game of stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	139	2022
Bootstrap latent-predictive representations for multitask reinforcement learning ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar International Conference on Machine Learning, 3875-3886, 2020	138	2020
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018	128	2018
Inverse reinforcement learning through structured classification E Klein, M Geist, B Piot, O Pietquin Advances in neural information processing systems 25, 2012	119	2012
Approximate dynamic programming for two-player zero-sum markov games J Perolat, B Scherrer, B Piot, O Pietquin International Conference on Machine Learning, 1321-1329, 2015	113	2015
Bridging the gap between imitation learning and inverse reinforcement learning B Piot, M Geist, O Pietquin IEEE transactions on neural networks and learning systems 28 (8), 1814-1826, 2016	99	2016
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning A Gruslys, W Dabney, MG Azar, B Piot, M Bellemare, R Munos arXiv preprint arXiv:1704.04651, 2017	97	2017
Hindsight credit assignment A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ... Advances in neural information processing systems 32, 2019	89	2019
Boosted bellman residual minimization handling expert demonstrations B Piot, M Geist, O Pietquin Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014	87	2014
Byol works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... arXiv preprint arXiv:2010.10241, 2020	85	2020
Neural predictive belief representations ZD Guo, MG Azar, B Piot, BA Pires, R Munos arXiv preprint arXiv:1811.06407, 2018	84	2018

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs