Zhewei Yao

Cited by

	All	Since 2019
Citations	6159	6117
h-index	30	30
i10-index	45	45

2100

1050

525

1575

201820192020202120222023202429 93 334 710 1347 2063 1555

Public access

View all

20 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Michael MahoneyProfessor of Statistics, UC BerkeleyVerified email at stat.berkeley.edu
Kurt KeutzerProfessor of the Graduate School, EECS, University of California, BerkeleyVerified email at berkeley.edu
Amir GholamiResearch Scientist, University of California, BerkeleyVerified email at eecs.berkeley.edu
Zhen DongPhD & Postdoc at Berkeley AI ResearchVerified email at berkeley.edu
Linjian MaResearch scientist, Meta Platforms, Inc.Verified email at meta.com
Hao TanAdobe ResearchVerified email at adobe.com
Fred RoostaUniversity of QueenslandVerified email at uq.edu.au
Jinglai LiUniversity of BirminghamVerified email at bham.ac.uk

Zhewei Yao

Snowflake

Verified email at snowflake.com - Homepage

LLM Efficient AI MLSys


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey of quantization methods for efficient neural network inference A Gholami, S Kim, Z Dong, Z Yao, MW Mahoney, K Keutzer Low-Power Computer Vision, 291-326, 2022	1015	2022
Q-bert: Hessian based ultra low precision quantization of bert S Shen, Z Dong, J Ye, L Ma, Z Yao, A Gholami, MW Mahoney, K Keutzer Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 8815-8821, 2020	533	2020
Hawq: Hessian aware quantization of neural networks with mixed-precision Z Dong, Z Yao, A Gholami, MW Mahoney, K Keutzer Proceedings of the IEEE/CVF international conference on computer vision, 293-302, 2019	505	2019
Zeroq: A novel zero shot quantization framework Y Cai, Z Yao, Z Dong, A Gholami, MW Mahoney, K Keutzer Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	407	2020
How much can clip benefit vision-and-language tasks? S Shen, LH Li, H Tan, M Bansal, A Rohrbach, KW Chang, Z Yao, ... arXiv preprint arXiv:2107.06383, 2021	374	2021
I-bert: Integer-only bert quantization S Kim, A Gholami, Z Yao, MW Mahoney, K Keutzer International conference on machine learning, 5506-5518, 2021	292	2021
Pyhessian: Neural networks through the lens of the hessian Z Yao, A Gholami, K Keutzer, MW Mahoney 2020 IEEE international conference on big data (Big data), 581-590, 2020	268	2020
Hawq-v2: Hessian aware trace-weighted quantization of neural networks Z Dong, Z Yao, D Arfeen, A Gholami, MW Mahoney, K Keutzer Advances in neural information processing systems 33, 18518-18529, 2020	262	2020
Zeroquant: Efficient and affordable post-training quantization for large-scale transformers Z Yao, R Yazdani Aminabadi, M Zhang, X Wu, C Li, Y He Advances in Neural Information Processing Systems 35, 27168-27183, 2022	243	2022
Adahessian: An adaptive second order optimizer for machine learning Z Yao, A Gholami, S Shen, M Mustafa, K Keutzer, M Mahoney proceedings of the AAAI conference on artificial intelligence 35 (12), 10665 …, 2021	242	2021
Hawq-v3: Dyadic neural network quantization Z Yao, Z Dong, Z Zheng, A Gholami, J Yu, E Tan, L Wang, Q Huang, ... International Conference on Machine Learning, 11875-11886, 2021	232	2021
Shallow neural networks for fluid flow reconstruction with limited sensors NB Erichson, L Mathelin, Z Yao, SL Brunton, MW Mahoney, JN Kutz Proceedings of the Royal Society A 476 (2238), 20200097, 2020	205	2020
Deepspeed-moe: Advancing mixture-of-experts inference and training to power next-generation ai scale S Rajbhandari, C Li, Z Yao, M Zhang, RY Aminabadi, AA Awan, J Rasley, ... International conference on machine learning, 18332-18346, 2022	175	2022
Hessian-based analysis of large batch training and robustness to adversaries Z Yao, A Gholami, Q Lei, K Keutzer, MW Mahoney Advances in Neural Information Processing Systems 31, 2018	163	2018
ANODEV2: A coupled neural ODE framework T Zhang, Z Yao, A Gholami, JE Gonzalez, K Keutzer, MW Mahoney, ... Advances in Neural Information Processing Systems 32, 2019	97	2019
Powernorm: Rethinking batch normalization in transformers S Shen, Z Yao, A Gholami, M Mahoney, K Keutzer International conference on machine learning, 8741-8751, 2020	85	2020
On the computational inefficiency of large batch sizes for stochastic gradient descent N Golmant, N Vemuri, Z Yao, V Feinberg, A Gholami, K Rothauge, ... arXiv preprint arXiv:1811.12941, 2018	84	2018
Improving semi-supervised federated learning by reducing the gradient diversity of models Z Zhang, Y Yang, Z Yao, Y Yan, JE Gonzalez, K Ramchandran, ... 2021 IEEE International Conference on Big Data (Big Data), 1214-1225, 2021	81	2021
Trust region based adversarial attack on neural networks Z Yao, A Gholami, P Xu, K Keutzer, MW Mahoney Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	69	2019
Inexact nonconvex newton-type methods Z Yao, P Xu, F Roosta, MW Mahoney INFORMS Journal on Optimization 3 (2), 154-182, 2021	67	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors