Παρακολούθηση
DJ Strouse
DJ Strouse
Senior Research Scientist, DeepMind
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International conference on machine learning, 3040-3049, 2019
495*2019
Infobot: Transfer and exploration via the information bottleneck
A Goyal, R Islam, D Strouse, Z Ahmed, M Botvinick, H Larochelle, ...
arXiv preprint arXiv:1901.10902, 2019
1602019
The deterministic information bottleneck
DJ Strouse, DJ Schwab
Neural computation 29 (6), 1611-1630, 2017
1552017
Collaborating with humans without human data
DJ Strouse, K McKee, M Botvinick, E Hughes, R Everett
Advances in Neural Information Processing Systems 34, 14502-14515, 2021
1212021
Learning to share and hide intentions using information regularization
DJ Strouse, M Kleiman-Weiner, J Tenenbaum, M Botvinick, DJ Schwab
Advances in neural information processing systems 31, 2018
662018
In-context reinforcement learning with algorithm distillation
M Laskin, L Wang, J Oh, E Parisotto, S Spencer, R Steigerwald, ...
arXiv preprint arXiv:2210.14215, 2022
592022
Semantic exploration from language abstractions and pretrained representations
A Tam, N Rabinowitz, A Lampinen, NA Roy, S Chan, DJ Strouse, J Wang, ...
Advances in neural information processing systems 35, 25377-25389, 2022
512022
The information bottleneck and geometric clustering
DJ Strouse, DJ Schwab
Neural computation 31 (3), 596-612, 2019
382019
Learning more skills through optimistic exploration
DJ Strouse, K Baumli, D Warde-Farley, V Mnih, S Hansen
arXiv preprint arXiv:2107.14226, 2021
312021
A neural architecture for designing truthful and efficient auctions
A Tacchetti, DJ Strouse, M Garnelo, T Graepel, Y Bachrach
arXiv preprint arXiv:1907.05181 3 (3.6), 4, 2019
302019
Melting Pot 2.0
JP Agapiou, AS Vezhnevets, EA Duéñez-Guzmán, J Matyas, Y Mao, ...
arXiv preprint arXiv:2211.13746, 2022
142022
Levinson's theorem for graphs
AM Childs, DJ Strouse
Journal of mathematical physics 52 (8), 2011
132011
How dendrites affect online recognition memory
X Wu, GC Mel, DJ Strouse, BW Mel
PLoS computational biology 15 (5), e1006892, 2019
122019
Confronting reward model overoptimization with constrained rlhf
T Moskovitz, AK Singh, DJ Strouse, T Sandholm, R Salakhutdinov, ...
arXiv preprint arXiv:2310.04373, 2023
92023
Learning truthful, efficient, and welfare maximizing auction rules
A Tacchetti, DJ Strouse, M Garnelo, T Graepel, Y Bachrach
arXiv preprint arXiv:1907.05181, 2019
42019
Optimization of Mutual Information in Learning: Explorations in Science
DJ Strouse
Princeton University, 2018
12018
Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs
AK Singh, DJ Strouse
arXiv preprint arXiv:2402.14903, 2024
2024
Neural network architecture for efficient resource allocation
A Tacchetti, DJ Strouse, MG Abellanas, TKH Graepel, Y Bachrach
US Patent 11,250,475, 2022
2022
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–18