Ashley Edwards
Ashley Edwards
Senior Research Scientist, Google DeepMind
Verified email at - Homepage
Cited by
Cited by
A generalist agent
S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ...
arXiv preprint arXiv:2205.06175, 2022
Imitating latent policies from observation
AD Edwards, H Sahni, Y Schroecker, CL Isbell
International Conference on Machine Learning (ICML 2019), 2019
Forward-backward reinforcement learning
AD Edwards, L Downs, JC Davidson
ICRA Machine Learning in Planning and Control of Robot Motion Workshop, 2018
Genie: Generative interactive environments
J Bruce, MD Dennis, A Edwards, J Parker-Holder, Y Shi, E Hughes, M Lai, ...
Forty-first International Conference on Machine Learning, 2024
Perceptual reward functions
A Edwards, C Isbell, A Takanishi
IJCAI Deep Reinforcement Learning: Frontiers and Challenges Workshop, 2016
Estimating Q (s, s') with Deep Deterministic Dynamics Gradients
AD Edwards, H Sahni, R Liu, J Hung, A Jain, R Wang, A Ecoffet, T Miconi, ...
International Conference on Machine Learning (ICML 2020), 2020
Perceptual Values from Observation
AD Edwards, CL Isbell
ICML Self-Supervised Learning Workshop, 2019
Cross-domain perceptual reward functions
AD Edwards, S Sood, CL Isbell Jr
The Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2017
Learning few-shot imitation as cultural transmission
A Bhoopchand, B Brownfield, A Collister, A Dal Lago, A Edwards, ...
Nature Communications 14 (1), 7536, 2023
Higher order Q-learning
A Edwards, WM Pottenger
2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2011
Perceptual Goal Specifications for Reinforcement Learning
AD Edwards
PhD thesis proposal, Georgia Institute of Technology, 2017
Learning Robust Real-Time Cultural Transmission without Human Data
CGI Team, A Bhoopchand, B Brownfield, A Collister, AD Lago, A Edwards, ...
arXiv preprint arXiv:2203.00715, 2022
Transferring Agent Behaviors from Videos via Motion GANs
AD Edwards, CL Isbell Jr
NIPS Deep Reinforcement Learning Symposium, 2017
Expressing Tasks Robustly via Multiple Discount Factors
A Edwards, ML Littman, CL Isbell
The Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2015
Autoregressively generating sequences of data elements defining actions to be performed by an agent
T Erez, A Novikov, E Parisotto, JW Rae, K Zolna, MMR Denil, ...
US Patent App. 17/410,689, 2023
Emulation and Imitation via Perceptual Goal Specifications
AD Edwards
PhD thesis, Georgia Institute of Technology, 2019
The system can't perform the operation now. Try again later.
Articles 1–16