Follow
Dale Schuurmans
Dale Schuurmans
University of Alberta, Google DeepMind
Verified email at cs.ualberta.ca - Homepage
Title
Cited by
Cited by
Year
Chain of thought prompting elicits reasoning in large language models
J Wei, X Wang, D Schuurmans, M Bosma, E Chi, Q Le, D Zhou
Advances in Neural Information Processing Systems 35, 2022
54552022
Self-consistency improves chain of thought reasoning in language models
X Wang, J Wei, D Schuurmans, Q Le, E Chi, S Narang, A Chowdhery, ...
arXiv preprint arXiv:2203.11171, 2022
1360*2022
Least-to-most prompting enables complex reasoning in large language models
D Zhou, N Schärli, L Hou, J Wei, N Scales, X Wang, D Schuurmans, C Cui, ...
arXiv preprint arXiv:2205.10625, 2022
7172022
Maximum margin clustering
L Xu, J Neufeld, B Larson, D Schuurmans
Advances in neural information processing systems 17, 2004
6292004
An optimistic perspective on offline reinforcement learning
R Agarwal, D Schuurmans, M Norouzi
International Conference on Machine Learning, 104-114, 2020
604*2020
Bridging the gap between value and policy based reinforcement learning
O Nachum, M Norouzi, K Xu, D Schuurmans
Advances in neural information processing systems 30, 2017
4982017
Advances in Large-Margin Classifiers
PJ Bartlett, B Schölkopf, D Schuurmans, AJ Smola
MIT Press 155, 156, 2000
489*2000
Learning with a Strong Adversary
R Huang, B Xu, D Schuurmans, C Szepesvari
https://arxiv.org/abs/1511.03034, 2015
4402015
Automatic Gait Optimization With Gaussian Process Regression.
DJ Lizotte, T Wang, MH Bowling, D Schuurmans
IJCAI 7, 944-949, 2007
3942007
Augmenting naive bayes classifiers with statistical language models
F Peng, D Schuurmans, S Wang
Information Retrieval 7, 317-345, 2004
3882004
Boosting in the limit: Maximizing the margin of learned ensembles
AJ Grove, D Schuurmans
AAAI/IAAI, 692-699, 1998
3801998
Discriminative batch mode active learning
Y Guo, D Schuurmans
Advances in neural information processing systems 20, 2007
3772007
Combining Naive Bayes and n-Gram Language Models for Text Classification
F Peng, D Schuurmans
European Conference on Information Retrieval, 335-350, 2003
2752003
Language independent authorship attribution with character level n-grams
F Peng, D Schuurmans, V Keselj, S Wang
10th Conference of the European Chapter of the Association for Computational …, 2003
275*2003
On the global convergence rates of softmax policy gradient methods
J Mei, C Xiao, C Szepesvari, D Schuurmans
International conference on machine learning, 6820-6829, 2020
2742020
What learning algorithm is in-context learning? investigations with linear models
E Akyürek, D Schuurmans, J Andreas, T Ma, D Zhou
arXiv preprint arXiv:2211.15661, 2022
2642022
Reward augmented maximum likelihood for neural structured prediction
M Norouzi, S Bengio, N Jaitly, M Schuster, Y Wu, D Schuurmans
Advances In Neural Information Processing Systems 29, 2016
2482016
Algaedice: Policy gradient from arbitrary experience
O Nachum, B Dai, I Kostrikov, Y Chow, L Li, D Schuurmans
arXiv preprint arXiv:1912.02074, 2019
2292019
Understanding the impact of entropy on policy optimization
Z Ahmed, N Le Roux, M Norouzi, D Schuurmans
International conference on machine learning, 151-160, 2019
2262019
Unsupervised and semi-supervised multi-class support vector machines
L Xu, D Schuurmans
AAAI 40, 50, 2005
2192005
The system can't perform the operation now. Try again later.
Articles 1–20