William Fedus

Cited by

	All	Since 2019
Citations	16784	16342
h-index	33	32
i10-index	38	38

8000

4000

2000

6000

2018201920202021202220232024142 379 643 1186 2348 7530 4165

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Hugo LarochelleGoogle DeepMind & MilaVerified email at google.com
Andrew DaiGoogle DeepMindVerified email at google.com
Ian GoodfellowDeepMindVerified email at deepmind.com
Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaVerified email at cs.mcgill.ca
Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
Sergey LevineUC Berkeley, Physical IntelligenceVerified email at eecs.berkeley.edu
Emmanuel BengioMcGill UniversityVerified email at mail.mcgill.ca
Valentin ThomasPhD student, Mila, University of MontrealVerified email at umontreal.ca
David A. MeyerUniversity of California, San DiegoVerified email at math.ucsd.edu

William Fedus

OpenAI

Verified email at openai.com - Homepage

Artificial Intelligence Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Palm: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... Journal of Machine Learning Research 24 (240), 1-113, 2023	3376	2023
Deep Graph Infomax. P Velickovic, W Fedus, WL Hamilton, P Liò, Y Bengio, RD Hjelm ICLR (Poster) 2 (3), 4, 2019	1968	2019
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024	1541	2024
Emergent abilities of large language models J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ... arXiv preprint arXiv:2206.07682, 2022	1397	2022
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023	1336*	2023
Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity W Fedus, B Zoph, N Shazeer Journal of Machine Learning Research 23 (120), 1-39, 2022	1298	2022
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022	729	2022
Deep graph infomax P Veličković, W Fedus, WL Hamilton, P Liò, Y Bengio, RD Hjelm arXiv preprint arXiv:1809.10341, 2018	630	2018
MaskGAN: Better Text Generation via Filling in the ______ W Fedus, I Goodfellow, AM Dai International Conference on Learning Representations (ICLR 2018), 2018	597	2018
In silico labeling: Predicting fluorescent labels in unlabeled images SF Eric Christiansen, Samuel J. Yang, D. Michael Ando, Ashkan Javaherian ... Cell, 2018	581	2018
Chatgpt: Optimizing language models for dialogue J Schulman, B Zoph, C Kim, J Hilton, J Menick, J Weng, JFC Uribe, ... OpenAI blog 2, 4, 2022	337	2022
Glam: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International Conference on Machine Learning, 5547-5569, 2022	331	2022
Revisiting resnets: Improved training and scaling strategies I Bello, W Fedus, X Du, ED Cubuk, A Srinivas, TY Lin, J Shlens, B Zoph Advances in Neural Information Processing Systems 34, 22614-22627, 2021	296	2021
Revisiting fundamentals of experience replay W Fedus, P Ramachandran, R Agarwal, Y Bengio, H Larochelle, ... International Conference on Machine Learning, 3061-3071, 2020	258	2020
Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step W Fedus, M Rosca, B Lakshminarayanan, AM Dai, S Mohamed, ... International Conference on Learning Representations (ICLR 2018), 2017	249	2017
The case for a directional dark matter detector and the status of current experimental efforts S Ahlen, N Afshordi, JBR Battat, J Billard, N Bozorgnia, S Burgos, ... International Journal of Modern Physics A 25 (01), 1-51, 2010	245	2010
Language GANs Falling Short M Caccia, L Caccia, W Fedus, H Larochelle, J Pineau, L Charlin International Conference on Learning Representations (ICLR 2020), 2018	227	2018
Toju Duke, Lucas Dixon, Kun Zhang, Quoc V N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... Le, Yonghui Wu, Zhifeng Chen, and Claire Cui, 2021	123	2021
First dark matter search results from a surface run of the 10-L DMTPC directional dark matter detector S Ahlen, JBR Battat, T Caldwell, C Deaconu, D Dujmic, W Fedus, P Fisher, ... Physics Letters B 695 (1-4), 124-129, 2011	112	2011
Hyperbolic discounting and learning over multiple horizons W Fedus, C Gelada, Y Bengio, MG Bellemare, H Larochelle Reinforcement Learning and Decision Making (RLDM 2019), 2019	109	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors