Yuhuai(Tony) Wu

Cited by

	All	Since 2019
Citations	15656	14948
h-index	34	33
i10-index	45	45

6000

3000

1500

4500

20162017201820192020202120222023202440 148 477 816 1586 1989 2747 5119 2645

Public access

View all

17 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Roger GrosseAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Jimmy BaUniversity of TorontoVerified email at cs.toronto.edu
Christian SzegedyResearcherVerified email at szegedy.org
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Ruslan SalakhutdinovUPMC Professor, Machine Learning Department, CMUVerified email at cs.cmu.edu
Behnam NeyshaburSenior Staff Research Scientist, DeepMindVerified email at google.com
David DuvenaudAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Albert Q. JiangUniversity of Cambridge | Mistral AIVerified email at mistral.ai
Percy LiangAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Saizheng Zhang
Oriol VinyalsResearch Scientist at Google DeepMindVerified email at google.com

Yuhuai(Tony) Wu

Co-Founder of xAI

Verified email at x.ai - Homepage

Machine Learning Machine Reasoning Theorem Proving


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... Nature 575 (7782), 350-354, 2019	4692*	2019
On the opportunities and risks of foundation models R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2021	2869	2021
Openai baselines P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...	1846*	2017
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	846	2023
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation Y Wu, E Mansimov, RB Grosse, S Liao, J Ba Advances in Neural Information Processing Systems, 5283-5292, 2017	794	2017
Holistic evaluation of language models P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ... arXiv preprint arXiv:2211.09110, 2022	641	2022
Solving quantitative reasoning problems with language models A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ... Advances in Neural Information Processing Systems 35, 3843-3857, 2022	433	2022
Backpropagation through the void: Optimizing control variates for black-box gradient estimation W Grathwohl, D Choi, Y Wu, G Roeder, D Duvenaud ICLR2018, 2017	315	2017
STaR: Bootstrapping reasoning with reasoning E Zelikman, Y Wu, ND Goodman arXiv preprint arXiv:2203.14465, 2022	285*	2022
On the quantitative analysis of decoder-based generative models Y Wu, Y Burda, R Salakhutdinov, R Grosse 5th International Conference on Learning Representations (ICLR 2017), 2016	268	2016
Sticking the landing: Simple, lower-variance gradient estimators for variational inference G Roeder, Y Wu, DK Duvenaud Advances in Neural Information Processing Systems 30, 2017	260*	2017
Architectural complexity measures of recurrent neural networks S Zhang, Y Wu, T Che, Z Lin, R Memisevic, RR Salakhutdinov, Y Bengio Advances in neural information processing systems 29, 2016	190	2016
STDP-compatible approximation of backpropagation in an energy-based model Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu Neural computation 29 (3), 555-577, 2017	182*	2017
On multiplicative integration with recurrent neural networks Y Wu, S Zhang, Y Zhang, Y Bengio, RR Salakhutdinov Advances in neural information processing systems 29, 2016	181	2016
Memorizing Transformers Y Wu, MN Rabe, DL Hutchins, C Szegedy International Conference on Learning Representations 2022, 2022	170	2022
The Importance of Sampling in Meta-Reinforcement Learning B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ... Advances in Neural Information Processing Systems, 9299-9309, 2018	165*	2018
Understanding Short-Horizon Bias in Stochastic Meta-Optimization Y Wu, M Ren, R Liao, RB Grosse Sixth International Conference on Learning Representations (ICLR 2018), 2018	133	2018
Invariant Causal Representation Learning for Out-of-Distribution Generalization C Lu, Y Wu, JM Hernández-Lobato, B Schölkopf International Conference on Learning Representations, 2022	121*	2022
Exploring length generalization in large language models C Anil, Y Wu, A Andreassen, A Lewkowycz, V Misra, V Ramasesh, ... Advances in Neural Information Processing Systems 35, 38546-38556, 2022	110	2022
Autoformalization with large language models Y Wu, AQ Jiang, W Li, M Rabe, C Staats, M Jamnik, C Szegedy Advances in Neural Information Processing Systems 35, 32353-32368, 2022	100	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors