Follow
Qiyang Li
Title
Cited by
Cited by
Year
Offline reinforcement learning as one big sequence modeling problem
M Janner, Q Li, S Levine
Advances in neural information processing systems 34, 1273-1286, 2021
7792021
Timbretron: A wavenet (cyclegan (cqt (audio))) pipeline for musical timbre transfer
S Huang, Q Li, C Anil, X Bao, S Oore, RB Grosse
arXiv preprint arXiv:1811.09620, 2018
1422018
Preventing gradient attenuation in lipschitz constrained convolutional networks
Q Li, S Haque, C Anil, J Lucas, RB Grosse, JH Jacobsen
Advances in neural information processing systems 32, 2019
1152019
Deep neural networks for improved, impromptu trajectory tracking of quadrotors
Q Li, J Qian, Z Zhu, X Bao, MK Helwa, AP Schoellig
2017 IEEE International Conference on Robotics and Automation (ICRA), 5183-5189, 2017
1132017
Openeqa: Embodied question answering in the era of foundation models
A Majumdar, A Ajay, X Zhang, P Putta, S Yenamandra, M Henaff, S Silwal, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
872024
Learning Visuotactile Skills with Two Multifingered Hands
T Lin, Y Zhang, Q Li, H Qi, B Yi, S Levine, J Malik
arXiv preprint arXiv:2404.16823, 2024
332024
Efficient deep reinforcement learning requires regulating overfitting
Q Li, A Kumar, I Kostrikov, S Levine
arXiv preprint arXiv:2304.10466, 2023
332023
Building a winning self-driving car in six months
K Burnett, A Schimpe, S Samavi, M Gridseth, CW Liu, Q Li, Z Kroeze, ...
2019 International Conference on Robotics and Automation (ICRA), 9583-9589, 2019
232019
Understanding the complexity gains of single-task rl with a curriculum
Q Li, Y Zhai, Y Ma, S Levine
International Conference on Machine Learning, 20412-20451, 2023
152023
Learning of coordination policies for robotic swarms
Q Li, X Du, Y Huang, Q Sykora, AP Schoellig
arXiv preprint arXiv:1709.06620, 2017
102017
Accelerating exploration with unlabeled prior data
Q Li, J Zhang, D Ghosh, A Zhang, S Levine
Advances in Neural Information Processing Systems 36, 67434-67458, 2023
92023
REFACTOR: Learning to Extract Theorems from Proofs
JP Zhou, Y Wu, Q Li, R Grosse
arXiv preprint arXiv:2402.17032, 2024
62024
AdaCat: Adaptive categorical discretization for autoregressive models
Q Li, A Jain, P Abbeel
Uncertainty in Artificial Intelligence, 1188-1198, 2022
22022
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
C Xu, Q Li, J Luo, S Levine
arXiv preprint arXiv:2412.09858, 2024
2024
Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Z Zhou, A Peng, Q Li, S Levine, A Kumar
arXiv preprint arXiv:2412.07762, 2024
2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
M Wilcoxson, Q Li, K Frans, S Levine
arXiv preprint arXiv:2410.18076, 2024
2024
R-LAtte: Attention Module for Visual Control via Reinforcement Learning
M Zhao, Q Li, A Srinivas, I Clavera, K Lee, P Abbeel
The system can't perform the operation now. Try again later.
Articles 1–17