Abbas Abdolmaleki

Geciteerd door

	Alles	Sinds 2019
Citaties	4048	3797
h-index	27	24
i10-index	44	36

1200

600

300

900

2014201520162017201820192020202120222023202414 28 29 62 81 177 377 536 890 1104 698

Openbare toegang

Alles bekijken

12 artikelen

1 artikel

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Martin RiedmillerDeepMindGeverifieerd e-mailadres voor google.com
Nicolas HeessDeepMindGeverifieerd e-mailadres voor google.com
Michael NeunertGoogle DeepMindGeverifieerd e-mailadres voor google.com
Luis Paulo ReisAssociate Professor, University of PortoGeverifieerd e-mailadres voor fe.up.pt
Nuno LauUniversidade de AveiroGeverifieerd e-mailadres voor ua.pt
Thomas LampeDeepMindGeverifieerd e-mailadres voor google.com
Yuval TassaSenior Research Scientist, Google DeepMindGeverifieerd e-mailadres voor google.com
Roland HafnerDeepMindGeverifieerd e-mailadres voor google.com
Gerhard NeumannProfessor, Karlsruhe Institute of Technology (KIT)Geverifieerd e-mailadres voor robot-learning.de
Noah Y. SiegelGoogle DeepMindGeverifieerd e-mailadres voor google.com
Josh MerelGeverifieerd e-mailadres voor google.com
Steven BohezGoogle DeepMindGeverifieerd e-mailadres voor google.com
Nima ShafiiNVIDIAGeverifieerd e-mailadres voor nvidia.com
Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKIGeverifieerd e-mailadres voor ias.tu-darmstadt.de
Rudolf LioutikovTT-Professor, Intuitive Robots Lab, Karlsruhe Institute of TechnologyGeverifieerd e-mailadres voor kit.edu
Jost Tobias SpringenbergGoogle DeepMind

Volgen

Abbas Abdolmaleki

Deepmind

Geverifieerd e-mailadres voor google.com

Artificial Intelligence Reinforcement Learning Robotics


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
Magnetic control of tokamak plasmas through deep reinforcement learning J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ... Nature 602 (7897), 414-419, 2022	707	2022
Deepmind control suite Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ... arXiv preprint arXiv:1801.00690, 2018	586	2018
Maximum a posteriori policy optimisation A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ... arXiv preprint arXiv:1806.06920, 2018	505	2018
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ... arXiv preprint arXiv:2002.08396, 2020	291	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	239	2020
Robust reinforcement learning for continuous control with model misspecification DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ... arXiv preprint arXiv:1906.07516, 2019	122	2019
From motor control to team play in simulated humanoid football S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ... Science Robotics 7 (69), eabo0235, 2022	112	2022
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ... arXiv preprint arXiv:1909.12238, 2019	112	2019
Model-based relative entropy stochastic search A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann Advances in Neural Information Processing Systems 28, 2015	93	2015
Continuous-discrete reinforcement learning for hybrid control in robotics M Neunert, A Abdolmaleki, M Wulfmeier, T Lampe, T Springenberg, ... Conference on Robot Learning, 735-751, 2020	90	2020
Beyond pick-and-place: Tackling robotic stacking of diverse shapes AX Lee, CM Devin, Y Zhou, T Lampe, K Bousmalis, JT Springenberg, ... 5th Annual Conference on Robot Learning, 2021	83	2021
A distributional view on multi-objective policy optimization A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ... International conference on machine learning, 11-22, 2020	77	2020
Value constrained model-free continuous control S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell arXiv preprint arXiv:1902.04623, 2019	70	2019
Relative entropy regularized policy iteration A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ... arXiv preprint arXiv:1812.02256, 2018	70	2018
Robocat: A self-improving foundation agent for robotic manipulation K Bousmalis, G Vezzani, D Rao, C Devin, AX Lee, M Bauza, T Davchev, ... arXiv preprint arXiv:2306.11706, 2023	55	2023
Model-free trajectory optimization for reinforcement learning R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki International Conference on Machine Learning, 2961-2970, 2016	49	2016
Data-efficient hindsight off-policy option learning M Wulfmeier, D Rao, R Hafner, T Lampe, A Abdolmaleki, T Hertweck, ... International Conference on Machine Learning, 11340-11350, 2021	46	2021
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ... Conference on Robot Learning, 566-589, 2020	43	2020
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing N Shafii, A Khorsandian, A Abdolmaleki, B Jozi 2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009	41	2009
Deriving and improving cma-es with information geometric trust regions A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017	40	2017

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs