Wenhao Yang

Cited by

	All	Since 2019
Citations	2333	2330
h-index	7	7
i10-index	6	6

880

440

220

660

20192020202120222023202419 145 370 655 877 263

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Zhihua ZhangProfessor of Computer Science, Shanghai Jiao Tong UniversityVerified email at zju.edu.cn
Shusen WangXiaohongshuVerified email at xiaohongshu.com
Xiang LiUniversity of PennsylvaniaVerified email at upenn.edu
Liangyu ZhangPhD student at Peking UniversityVerified email at pku.edu.cn
Tadashi KozunoOmron Sinic XVerified email at sinicx.com
Hao JinPeking UniversityVerified email at pku.edu.cn
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Jiadong Liangpeking universityVerified email at pku.edu.cn
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Mohammad Gheshlaghi AzarCohere AIVerified email at google.com
Rémi MunosDeepMindVerified email at inria.fr
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Pierre MénardOvGU MagdeburgVerified email at inria.fr
Toshinori KitamuraThe University of TokyoVerified email at weblab.t.u-tokyo.ac.jp
Nino VieillardGoogle DeepMindVerified email at google.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Jincheng MeiResearch Scientist, Google BrainVerified email at google.com
Yunhao TangResearch Scientist, DeepMindVerified email at columbia.edu
Scott M. JordanPostdoctoral Fellow, University of AlbertaVerified email at ualberta.ca

Wenhao Yang

Stanford University

Verified email at stanford.edu - Homepage

Reinforcement Learning Optimization Statistics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
On the Convergence of FedAvg on Non-IID Data X Li, K Huang, W Yang, S Wang, Z Zhang arXiv preprint arXiv:1907.02189, 2019	2068	2019
Communication-efficient local decentralized SGD methods X Li, W Yang, S Wang, Z Zhang arXiv preprint arXiv:1910.09126, 2019	105*	2019
Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics W Yang, L Zhang, Z Zhang The Annals of Statistics 50 (6), 3223-3248, 2022	47	2022
Federated Reinforcement Learning with Environment Heterogeneity H Jin, Y Peng, W Yang, S Wang, Z Zhang International Conference on Artificial Intelligence and Statistics, 18-37, 2022	43	2022
A regularized approach to sparse optimal policy in reinforcement learning W Yang, X Li, Z Zhang Advances in Neural Information Processing Systems 32, 2019	35*	2019
A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning X Li, W Yang, J Liang, Z Zhang, MI Jordan International Conference on Artificial Intelligence and Statistics, 2207-2261, 2023	14*	2023
Robust Markov Decision Processes without Model Estimation W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang arXiv preprint arXiv:2302.01248, 2023	7*	2023
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022	5	2022
Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs W Yang, X Li, G Xie, Z Zhang arXiv preprint arXiv:2011.00213, 2020	3	2020
Regularization and variance-weighted regression achieves minimax optimality in linear MDPs: theory and practice T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ... International Conference on Machine Learning, 17135-17175, 2023	2	2023
Semi-infinitely Constrained Markov Decision Processes L Zhang, Y Peng, W Yang, Z Zhang Advances in Neural Information Processing Systems 35, 16808-16820, 2022	2	2022
Estimation and Inference in Distributional Reinforcement Learning L Zhang, Y Peng, J Liang, W Yang, Z Zhang arXiv preprint arXiv:2309.17262, 2023	1	2023
Semiparametrically efficient off-policy evaluation in linear Markov decision processes C Xie, W Yang, Z Zhang International Conference on Machine Learning, 38227-38257, 2023	1	2023
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning L Zhang, Y Peng, W Yang, Z Zhang IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-14, 2023		2023
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach M Lu, W Yang, L Zhang, Z Zhang arXiv preprint arXiv:2209.05186, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–15

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors