Ray: A distributed framework for emerging {AI} applications P Moritz, R Nishihara, S Wang, A Tumanov, R Liaw, E Liang, M Elibol, ... 13th USENIX symposium on operating systems design and implementation (OSDI …, 2018 | 1499 | 2018 |
RLlib: Abstractions for distributed reinforcement learning E Liang, R Liaw, R Nishihara, P Moritz, R Fox, K Goldberg, J Gonzalez, ... International conference on machine learning, 3053-3062, 2018 | 1240* | 2018 |
Tune: A research platform for distributed model selection and training R Liaw, E Liang, R Nishihara, P Moritz, JE Gonzalez, I Stoica arXiv preprint arXiv:1807.05118, 2018 | 1156 | 2018 |
Benchmarks for reinforcement learning in mixed-autonomy traffic E Vinitsky, A Kreidieh, L Le Flem, N Kheterpal, K Jang, C Wu, F Wu, ... Conference on robot learning, 399-409, 2018 | 188 | 2018 |
Real-time machine learning: The missing pieces R Nishihara, P Moritz, S Wang, A Tumanov, W Paul, J Schleier-Smith, ... Proceedings of the 16th workshop on hot topics in operating systems, 106-110, 2017 | 82 | 2017 |
SWIRL: A sequential windowed inverse reinforcement learning algorithm for robot tasks with delayed rewards S Krishnan, A Garg, R Liaw, B Thananjeyan, L Miller, FT Pokorny, ... The international journal of robotics research 38 (2-3), 126-145, 2019 | 68 | 2019 |
Hypersched: Dynamic resource reallocation for model development on a deadline R Liaw, R Bhardwaj, L Dunlap, Y Zou, JE Gonzalez, I Stoica, A Tumanov Proceedings of the ACM Symposium on Cloud Computing, 61-73, 2019 | 56 | 2019 |
Large batch size training of neural networks with adversarial training and second-order information Z Yao, A Gholami, D Arfeen, R Liaw, J Gonzalez, K Keutzer, M Mahoney arXiv preprint arXiv:1810.01021, 2018 | 55 | 2018 |
Hirl: Hierarchical inverse reinforcement learning for long-horizon tasks with delayed rewards S Krishnan, A Garg, R Liaw, L Miller, FT Pokorny, K Goldberg arXiv preprint arXiv:1604.06508, 2016 | 53 | 2016 |
Rubberband: cloud-based hyperparameter tuning U Misra, R Liaw, L Dunlap, R Bhardwaj, K Kandasamy, JE Gonzalez, ... Proceedings of the Sixteenth European Conference on Computer Systems, 327-342, 2021 | 30 | 2021 |
Iterative noise injection for scalable imitation learning M Laskey, J Lee, W Hsieh, R Liaw, J Mahler, R Fox, K Goldberg arXiv preprint arXiv:1703.09327, 2017 | 25 | 2017 |
Composing meta-policies for autonomous driving using hierarchical deep reinforcement learning R Liaw, S Krishnan, A Garg, D Crankshaw, JE Gonzalez, K Goldberg arXiv preprint arXiv:1711.01503, 2017 | 23 | 2017 |
Ray: A distributed framework for emerging AI applications. CoRR abs/1712.05889 (2017) P Moritz, R Nishihara, S Wang, A Tumanov, R Liaw, E Liang, W Paul, ... arXiv preprint arXiv:1712.05889, 2017 | 23 | 2017 |
SWIRL: A SequentialWindowed Inverse Reinforcement Learning Algorithm for Robot Tasks With Delayed Rewards S Krishnan, A Garg, R Liaw, B Thananjeyan, L Miller, FT Pokorny, ... Algorithmic Foundations of Robotics XII: Proceedings of the Twelfth Workshop …, 2020 | 14 | 2020 |
Impact: Importance weighted asynchronous architectures with clipped target networks M Luo, J Yao, R Liaw, E Liang, I Stoica arXiv preprint arXiv:1912.00167, 2019 | 12 | 2019 |
Elastic hyperparameter tuning on the cloud L Dunlap, K Kandasamy, U Misra, R Liaw, M Jordan, I Stoica, ... Proceedings of the ACM Symposium on Cloud Computing, 33-46, 2021 | 7 | 2021 |
ESCHER: expressive scheduling with ephemeral resources R Bhardwaj, A Tumanov, S Wang, R Liaw, P Moritz, R Nishihara, I Stoica Proceedings of the 13th Symposium on Cloud Computing, 47-62, 2022 | 6 | 2022 |
REVEAL 2022: Reinforcement Learning-Based Recommender Systems at Scale R Liaw, P Bailey, Y Li, M Dimakopoulou, Y Raimond Proceedings of the 16th ACM Conference on Recommender Systems, 684-685, 2022 | 3 | 2022 |
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards. CoRR abs/1604.06508 (2016) S Krishnan, A Garg, R Liaw, L Miller, FT Pokorny, K Goldberg | 3 | 2016 |