Follow
DJ Strouse
DJ Strouse
Research Scientist, DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International conference on machine learning, 3040-3049, 2019
5562019
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
3582024
Infobot: Transfer and exploration via the information bottleneck
A Goyal, R Islam, DJ Strouse, Z Ahmed, H Larochelle, M Botvinick, ...
International Conference on Learning Representations, 2018
1692018
The deterministic information bottleneck
DJ Strouse, DJ Schwab
Neural computation 29 (6), 1611-1630, 2017
1662017
Collaborating with humans without human data
DJ Strouse, K McKee, M Botvinick, E Hughes, R Everett
Advances in Neural Information Processing Systems 34, 14502-14515, 2021
1602021
In-context reinforcement learning with algorithm distillation
M Laskin, L Wang, J Oh, E Parisotto, S Spencer, R Steigerwald, ...
arXiv preprint arXiv:2210.14215, 2022
852022
Learning to share and hide intentions using information regularization
DJ Strouse, M Kleiman-Weiner, J Tenenbaum, M Botvinick, DJ Schwab
Advances in neural information processing systems 31, 2018
702018
Semantic exploration from language abstractions and pretrained representations
A Tam, N Rabinowitz, A Lampinen, NA Roy, S Chan, DJ Strouse, J Wang, ...
Advances in neural information processing systems 35, 25377-25389, 2022
622022
Learning more skills through optimistic exploration
DJ Strouse, K Baumli, D Warde-Farley, V Mnih, S Hansen
arXiv preprint arXiv:2107.14226, 2021
422021
The information bottleneck and geometric clustering
DJ Strouse, DJ Schwab
Neural computation 31 (3), 596-612, 2019
412019
A neural architecture for designing truthful and efficient auctions
A Tacchetti, DJ Strouse, M Garnelo, T Graepel, Y Bachrach
arXiv preprint arXiv:1907.05181 3 (3.6), 4, 2019
342019
Melting Pot 2.0
JP Agapiou, AS Vezhnevets, EA Duéñez-Guzmán, J Matyas, Y Mao, ...
arXiv preprint arXiv:2211.13746, 2022
282022
Confronting reward model overoptimization with constrained rlhf
T Moskovitz, AK Singh, DJ Strouse, T Sandholm, R Salakhutdinov, ...
arXiv preprint arXiv:2310.04373, 2023
222023
Levinson's theorem for graphs
AM Childs, DJ Strouse
Journal of mathematical physics 52 (8), 2011
152011
How dendrites affect online recognition memory
X Wu, GC Mel, DJ Strouse, BW Mel
PLoS computational biology 15 (5), e1006892, 2019
142019
Tokenization counts: the impact of tokenization on arithmetic in frontier llms
AK Singh, DJ Strouse
arXiv preprint arXiv:2402.14903, 2024
132024
Learning truthful, efficient, and welfare maximizing auction rules
A Tacchetti, DJ Strouse, M Garnelo, T Graepel, Y Bachrach
arXiv preprint arXiv:1907.05181, 2019
52019
Optimization of Mutual Information in Learning: Explorations in Science
DJ Strouse
Princeton University, 2018
12018
REINFORCEMENT LEARNING USING AN ENSEMBLE OF DISCRIMINATOR MODELS
SS Hansen, DJ Strouse
US Patent App. 18/281,711, 2024
2024
Neural network architecture for efficient resource allocation
A Tacchetti, DJ Strouse, MG Abellanas, TKH Graepel, Y Bachrach
US Patent 11,250,475, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20