Volgen
Abbas Abdolmaleki
Abbas Abdolmaleki
Deepmind
Geverifieerd e-mailadres voor google.com
Titel
Geciteerd door
Geciteerd door
Jaar
Magnetic control of tokamak plasmas through deep reinforcement learning
J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ...
Nature 602 (7897), 414-419, 2022
8772022
Deepmind control suite
Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ...
arXiv preprint arXiv:1801.00690, 2018
6472018
Maximum a posteriori policy optimisation
A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ...
arXiv preprint arXiv:1806.06920, 2018
5362018
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning
NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ...
arXiv preprint arXiv:2002.08396, 2020
3142020
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2662020
Robust reinforcement learning for continuous control with model misspecification
DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ...
arXiv preprint arXiv:1906.07516, 2019
1312019
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control
HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ...
arXiv preprint arXiv:1909.12238, 2019
1222019
From motor control to team play in simulated humanoid football
S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ...
Science Robotics 7 (69), eabo0235, 2022
1192022
Continuous-discrete reinforcement learning for hybrid control in robotics
M Neunert, A Abdolmaleki, M Wulfmeier, T Lampe, T Springenberg, ...
Conference on Robot Learning, 735-751, 2020
1042020
Beyond pick-and-place: Tackling robotic stacking of diverse shapes
AX Lee, CM Devin, Y Zhou, T Lampe, K Bousmalis, JT Springenberg, ...
5th Annual Conference on Robot Learning, 2021
1032021
Model-based relative entropy stochastic search
A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann
Advances in Neural Information Processing Systems 28, 2015
982015
A distributional view on multi-objective policy optimization
A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ...
International conference on machine learning, 11-22, 2020
862020
Robocat: A self-improving foundation agent for robotic manipulation
K Bousmalis, G Vezzani, D Rao, C Devin, AX Lee, M Bauza, T Davchev, ...
arXiv preprint arXiv:2306.11706, 2023
852023
Value constrained model-free continuous control
S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell
arXiv preprint arXiv:1902.04623, 2019
722019
Relative entropy regularized policy iteration
A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ...
arXiv preprint arXiv:1812.02256, 2018
712018
Model-free trajectory optimization for reinforcement learning
R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki
International Conference on Machine Learning, 2961-2970, 2016
522016
Data-efficient hindsight off-policy option learning
M Wulfmeier, D Rao, R Hafner, T Lampe, A Abdolmaleki, T Hertweck, ...
International Conference on Machine Learning, 11340-11350, 2021
512021
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models
A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ...
Conference on Robot Learning, 566-589, 2020
472020
Deriving and improving cma-es with information geometric trust regions
A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann
Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017
412017
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing
N Shafii, A Khorsandian, A Abdolmaleki, B Jozi
2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009
412009
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20