Follow
Philipp Moritz
Title
Cited by
Cited by
Year
Trust Region Policy Optimization
J Schulman
arXiv preprint arXiv:1502.05477, 2015
86072015
High-dimensional continuous control using generalized advantage estimation
J Schulman, P Moritz, S Levine, M Jordan, P Abbeel
arXiv preprint arXiv:1506.02438, 2015
39022015
Ray: A distributed framework for emerging {AI} applications
P Moritz, R Nishihara, S Wang, A Tumanov, R Liaw, E Liang, M Elibol, ...
13th USENIX symposium on operating systems design and implementation (OSDI …, 2018
14172018
Tune: A research platform for distributed model selection and training
R Liaw, E Liang, R Nishihara, P Moritz, JE Gonzalez, I Stoica
arXiv preprint arXiv:1807.05118, 2018
10842018
RLlib: Abstractions for Distributed Reinforcement Learning
E Liang, R Liaw, P Moritz, R Nishihara, R Fox, K Goldberg, J Gonzalez, ...
International Conference on Machine Learning, 3059-3068, 2018
10012018
A linearly-convergent stochastic L-BFGS algorithm
P Moritz, R Nishihara, M Jordan
Artificial Intelligence and Statistics, 249-258, 2016
3042016
Sparknet: Training deep networks in spark
P Moritz, R Nishihara, I Stoica, MI Jordan
arXiv preprint arXiv:1511.06051, 2015
2262015
Ray rllib: A composable and scalable reinforcement learning library
E Liang, R Liaw, R Nishihara, P Moritz, R Fox, J Gonzalez, K Goldberg, ...
arXiv preprint arXiv:1712.09381, 85, 2017
1882017
Real-time machine learning: The missing pieces
R Nishihara, P Moritz, S Wang, A Tumanov, W Paul, J Schleier-Smith, ...
Proceedings of the 16th workshop on hot topics in operating systems, 106-110, 2017
822017
Lineage stash: fault tolerance off the critical path
S Wang, J Liagouris, R Nishihara, P Moritz, U Misra, A Tumanov, I Stoica
Proceedings of the 27th ACM Symposium on Operating Systems Principles, 338-352, 2019
552019
Policy gradient search: Online planning and expert iteration without search trees
T Anthony, R Nishihara, P Moritz, T Salimans, J Schulman
arXiv preprint arXiv:1904.03646, 2019
312019
Hoplite: efficient and fault-tolerant collective communication for task-based distributed systems
S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica
Proceedings of the 2021 ACM SIGCOMM 2021 Conference, 641-656, 2021
262021
Ray: A Distributed Execution Engine for the Machine Learning Ecosystem
PC Moritz
UC Berkeley, 2019
52019
ESCHER: expressive scheduling with ephemeral resources
R Bhardwaj, A Tumanov, S Wang, R Liaw, P Moritz, R Nishihara, I Stoica
Proceedings of the 13th Symposium on Cloud Computing, 47-62, 2022
42022
Trust Region Policy Optimization (TRPO)
J Schulman, S Levine, P Moritz, MI Jordan, P Abbeel
CoRR abs/1502.05477, 2015
42015
Flexible Primitives for Distributed Deep Learning in Ray
Y Bulatov, R Nishihara, P Moritz, M Elibol, I Stoica, MI Jordan
SysML Conference, 2018
12018
Hoplite: Efficient Collective Communication for Task-Based Distributed Systems.
S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica
CoRR, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–17