Understanding deep learning (still) requires rethinking generalization C Zhang, S Bengio, M Hardt, B Recht, O Vinyals Communications of the ACM 64 (3), 107-115, 2021 | 7617 | 2021 |
Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems T Chen, M Li, Y Li, M Lin, N Wang, M Wang, T Xiao, B Xu, C Zhang, ... arXiv preprint arXiv:1512.01274, 2015 | 2877 | 2015 |
Transfusion: Understanding transfer learning for medical imaging M Raghu, C Zhang, J Kleinberg, S Bengio Advances in Neural Information Processing Systems, 2019 | 1371 | 2019 |
Unsupervised feature selection for multi-cluster data D Cai, C Zhang, X He Proceedings of the 16th ACM SIGKDD international conference on Knowledge …, 2010 | 1365 | 2010 |
Do vision transformers see like convolutional neural networks? M Raghu, T Unterthiner, S Kornblith, C Zhang, A Dosovitskiy Advances in neural information processing systems 34, 12116-12128, 2021 | 1096 | 2021 |
Training deep nets with sublinear memory cost T Chen, B Xu, C Zhang, C Guestrin arXiv preprint arXiv:1604.06174, 2016 | 1065 | 2016 |
Learning with a Wasserstein loss C Frogner, C Zhang, H Mobahi, M Araya, TA Poggio Advances in neural information processing systems 28, 2015 | 716 | 2015 |
Machine theory of mind N Rabinowitz, F Perbet, F Song, C Zhang, SMA Eslami, M Botvinick International conference on machine learning, 4218-4227, 2018 | 653 | 2018 |
Quantifying memorization across neural language models N Carlini, D Ippolito, M Jagielski, K Lee, F Tramer, C Zhang arXiv preprint arXiv:2202.07646, 2022 | 620 | 2022 |
What is being transferred in transfer learning? B Neyshabur, H Sedghi, C Zhang Advances in Neural Information Processing Systems, 2020 | 540 | 2020 |
Deduplicating training data makes language models better K Lee, D Ippolito, A Nystrom, C Zhang, D Eck, C Callison-Burch, N Carlini arXiv preprint arXiv:2107.06499, 2021 | 538 | 2021 |
A study on overfitting in deep reinforcement learning C Zhang, O Vinyals, R Munos, S Bengio arXiv preprint arXiv:1804.06893, 2018 | 493 | 2018 |
What neural networks memorize and why: Discovering the long tail via influence estimation V Feldman, C Zhang Advances in Neural Information Processing Systems, Spotlight, 2020 | 448 | 2020 |
Automated fault detection without seismic processing M Araya-Polo, T Dahlke, C Frogner, C Zhang, T Poggio, D Hohl The Leading Edge 36 (3), 208-214, 2017 | 298 | 2017 |
Are all layers created equal? C Zhang, S Bengio, Y Singer Journal of Machine Learning Research 23 (67), 1-28, 2022 | 182 | 2022 |
Deep learning with label differential privacy B Ghazi, N Golowich, R Kumar, P Manurangsi, C Zhang Advances in neural information processing systems 34, 27131-27145, 2021 | 163 | 2021 |
Counterfactual memorization in neural language models C Zhang, D Ippolito, K Lee, M Jagielski, F Tramèr, N Carlini Advances in Neural Information Processing Systems 36, 39321-39362, 2023 | 135 | 2023 |
Preventing verbatim memorization in language models gives a false sense of privacy D Ippolito, F Tramèr, M Nasr, C Zhang, M Jagielski, K Lee, ... arXiv preprint arXiv:2210.17546, 2022 | 131 | 2022 |
Theory of deep learning IIb: Optimization properties of SGD C Zhang, Q Liao, A Rakhlin, B Miranda, N Golowich, T Poggio arXiv preprint arXiv:1801.02254, 2018 | 122* | 2018 |
Identity crisis: Memorization and generalization under extreme overparameterization C Zhang, S Bengio, M Hardt, MC Mozer, Y Singer The International Conference on Learning Representations, 2020 | 112 | 2020 |