Follow
Aurick Zhou
Aurick Zhou
Waymo
Verified email at berkeley.edu
Title
Cited by
Cited by
Year
Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor
T Haarnoja, A Zhou, P Abbeel, S Levine
International conference on machine learning, 1861-1870, 2018
46472018
Soft actor-critic algorithms and applications
T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ...
arXiv preprint arXiv:1812.05905, 2018
13992018
Conservative q-learning for offline reinforcement learning
A Kumar, A Zhou, G Tucker, S Levine
Advances in Neural Information Processing Systems 33, 1179-1191, 2020
6292020
Efficient off-policy meta-reinforcement learning via probabilistic context variables
K Rakelly, A Zhou, C Finn, S Levine, D Quillen
International conference on machine learning, 5331-5340, 2019
4322019
Learning to walk via deep reinforcement learning
T Haarnoja, S Ha, A Zhou, J Tan, G Tucker, S Levine
arXiv preprint arXiv:1812.11103, 2018
3252018
Composable deep reinforcement learning for robotic manipulation
T Haarnoja, V Pong, A Zhou, M Dalal, P Abbeel, S Levine
2018 IEEE international conference on robotics and automation (ICRA), 6244-6251, 2018
2062018
Mural: Meta-learning uncertainty-aware rewards for outcome-driven reinforcement learning
K Li, A Gupta, A Reddy, VH Pong, A Zhou, J Yu, S Levine
International conference on machine learning, 6346-6356, 2021
182021
Amortized conditional normalized maximum likelihood: Reliable out of distribution uncertainty estimation
A Zhou, S Levine
International Conference on Machine Learning, 12803-12812, 2021
12*2021
Bayesian adaptation for covariate shift
A Zhou, S Levine
Advances in Neural Information Processing Systems 34, 914-927, 2021
11*2021
Wayformer: Motion forecasting via simple & efficient attention networks
N Nayakanti, R Al-Rfou, A Zhou, K Goel, KS Refaat, B Sapp
arXiv preprint arXiv:2207.05844, 2022
72022
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
T Haarnoja, A Zhou, P Abbeel, S Levine
Proceedings of the 35th International Conference on Machine Learning. July …, 1861
11861
The system can't perform the operation now. Try again later.
Articles 1–11