Volgen
Philipp Moritz
Philipp Moritz
Graduate Student, UC Berkeley
Geverifieerd e-mailadres voor berkeley.edu - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Trust Region Policy Optimization
J Schulman
arXiv preprint arXiv:1502.05477, 2015
86282015
High-dimensional continuous control using generalized advantage estimation
J Schulman, P Moritz, S Levine, M Jordan, P Abbeel
arXiv preprint arXiv:1506.02438, 2015
39132015
Ray: A distributed framework for emerging {AI} applications
P Moritz, R Nishihara, S Wang, A Tumanov, R Liaw, E Liang, M Elibol, ...
13th USENIX symposium on operating systems design and implementation (OSDI …, 2018
14232018
Tune: A research platform for distributed model selection and training
R Liaw, E Liang, R Nishihara, P Moritz, JE Gonzalez, I Stoica
arXiv preprint arXiv:1807.05118, 2018
10932018
RLlib: Abstractions for Distributed Reinforcement Learning
E Liang, R Liaw, P Moritz, R Nishihara, R Fox, K Goldberg, J Gonzalez, ...
International Conference on Machine Learning, 3059-3068, 2018
10082018
A linearly-convergent stochastic L-BFGS algorithm
P Moritz, R Nishihara, M Jordan
Artificial Intelligence and Statistics, 249-258, 2016
3042016
Sparknet: Training deep networks in spark
P Moritz, R Nishihara, I Stoica, MI Jordan
arXiv preprint arXiv:1511.06051, 2015
2272015
Ray rllib: A composable and scalable reinforcement learning library
E Liang, R Liaw, R Nishihara, P Moritz, R Fox, J Gonzalez, K Goldberg, ...
arXiv preprint arXiv:1712.09381, 85, 2017
1882017
Real-time machine learning: The missing pieces
R Nishihara, P Moritz, S Wang, A Tumanov, W Paul, J Schleier-Smith, ...
Proceedings of the 16th workshop on hot topics in operating systems, 106-110, 2017
822017
Lineage stash: fault tolerance off the critical path
S Wang, J Liagouris, R Nishihara, P Moritz, U Misra, A Tumanov, I Stoica
Proceedings of the 27th ACM Symposium on Operating Systems Principles, 338-352, 2019
562019
Policy gradient search: Online planning and expert iteration without search trees
T Anthony, R Nishihara, P Moritz, T Salimans, J Schulman
arXiv preprint arXiv:1904.03646, 2019
312019
Hoplite: efficient and fault-tolerant collective communication for task-based distributed systems
S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica
Proceedings of the 2021 ACM SIGCOMM 2021 Conference, 641-656, 2021
272021
ESCHER: expressive scheduling with ephemeral resources
R Bhardwaj, A Tumanov, S Wang, R Liaw, P Moritz, R Nishihara, I Stoica
Proceedings of the 13th Symposium on Cloud Computing, 47-62, 2022
52022
Ray: A Distributed Execution Engine for the Machine Learning Ecosystem
PC Moritz
UC Berkeley, 2019
52019
Trust Region Policy Optimization (TRPO)
J Schulman, S Levine, P Moritz, MI Jordan, P Abbeel
CoRR abs/1502.05477, 2015
42015
Flexible Primitives for Distributed Deep Learning in Ray
Y Bulatov, R Nishihara, P Moritz, M Elibol, I Stoica, MI Jordan
SysML Conference, 2018
12018
Hoplite: Efficient Collective Communication for Task-Based Distributed Systems.
S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica
CoRR, 2020
2020
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–17