John Aslanides

Cited by

	All	Since 2019
Citations	4242	4189
h-index	15	14
i10-index	15	15

2000

1000

500

1500

2017201820192020202120222023202413 16 89 177 259 536 1157 1963

Co-authors

Nat McAleeseOpenAIVerified email at openai.com
Ian OsbandOpenAIVerified email at openai.com
Geoffrey IrvingUK AI Safety Institute (AISI)Verified email at naml.us
H Francis SongDeepMindVerified email at google.com
Benjamin Van RoyStanford UniversityVerified email at stanford.edu
Ethan PerezAnthropic; New York UniversityVerified email at anthropic.com
Tor LattimoreDeepMindVerified email at google.com
David SilverDeepMind, UCLVerified email at google.com
Yotam DoronDeepMindVerified email at google.com
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Eren SezenerDeepMindVerified email at google.com
Craig M SavageProfessor of Physics, Australian National UniversityVerified email at anu.edu.au
Silvia ChiappaSenior Staff Research Scientist, Google DeepMind; Honorary Professor, UCLVerified email at google.com
Jan LeikeOpenAIVerified email at openai.com
Marcus HutterResearcher@DeepMind & Professor at ANUVerified email at anu.edu.au
Nando de FreitasCIFAR & DeepMindVerified email at google.com
Bobak ShahriariDeepMindVerified email at google.com

John Aslanides

DeepMind

Verified email at google.com - Homepage

Machine Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Scaling Language Models: Methods, Analysis & Insights from Training Gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021	862	2021
Randomized Prior Functions for Deep Reinforcement Learning I Osband, J Aslanides, A Cassirer Neural Information Processing Systems 32, 2018	416	2018
Red Teaming Language Models with Language Models E Perez, S Huang, F Song, T Cai, R Ring, J Aslanides, A Glaese, ... arXiv preprint arXiv:2202.03286, 2022	376	2022
Improving alignment of dialogue agents via targeted human judgements A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ... arXiv preprint arXiv:2209.14375, 2022	355	2022
Acme: A Research Framework for Distributed Reinforcement Learning M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ... arXiv preprint arXiv:2006.00979, 2020	239	2020
When to use parametric models in reinforcement learning? H van Hasselt, M Hessel, J Aslanides Neural Information Processing Systems 33, 2019	211	2019
Behaviour Suite for Reinforcement Learning I Osband, Y Doron, M Hessel, J Aslanides, E Sezener, A Saraiva, ... International Conference on Learning Representations 8, 2020	180	2020
Teaching language models to support answers with verified quotes J Menick, M Trebacz, V Mikulik, J Aslanides, F Song, M Chadwick, ... arXiv preprint arXiv:2203.11147, 2022	165	2022
Fine-tuning language models to find agreement among humans with diverse preferences M Bakker, M Chadwick, H Sheahan, M Tessler, L Campbell-Gillingham, ... Advances in Neural Information Processing Systems 35, 38176-38189, 2022	144	2022
Relativity concept inventory: Development, analysis, and results JS Aslanides, CM Savage Physical Review Special Topics-Physics Education Research 9 (1), 010118, 2013	79	2013
A general approach to fairness with optimal transport S Chiappa, R Jiang, T Stepleton, A Pacchiano, H Jiang, J Aslanides AAAI, 2020	73*	2020
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning G Parascandolo, L Buesing, J Merel, L Hasenclever, J Aslanides, ... arXiv preprint arXiv:2004.11410, 2020	31	2020
Universal Reinforcement Learning Algorithms: Survey and Experiments J Aslanides, J Leike, M Hutter International Joint Conference on Artificial Intelligence 26, 1403-1410, 2017	25	2017
TF-Replicator: Distributed Machine Learning for Researchers P Buchlovsky, D Budden, D Grewe, C Jones, J Aslanides, F Besse, ... arXiv preprint arXiv:1902.00465, 2019	24	2019
Fine-Tuning Language Models via Epistemic Neural Networks I Osband, SM Asghari, B Van Roy, N McAleese, J Aslanides, G Irving arXiv preprint arXiv:2211.01568, 2022	9	2022
AIXIjs: A software demo for general reinforcement learning J Aslanides arXiv preprint arXiv:1705.07615, 2017	6	2017
Generalised discount functions applied to a Monte-Carlo AImu implementation S Lamont, J Aslanides, J Leike, M Hutter Autonomous Agents and Multiagent Systems, 2017, 2017	5	2017

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors