Harm van Seijen

Παρατίθεται από

	Όλα	Από το 2019
Παραθέσεις	1800	1482
h-index	17	16
i10-index	23	21

360

180

270

2011201220132014201520162017201820192020202120222023202411 7 15 17 24 29 63 143 166 202 249 323 346 195

Δημόσια πρόσβαση

Προβολή όλων

6 άρθρα

0 άρθρα

διαθέσιμα

μη διαθέσιμα

Σύμφωνα με εντολές χρηματοδότησης

Συν-συγγραφείς

Richard S. SuttonKeen, Amii, and University of AlbertaΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα richsutton.com
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα cs.ox.ac.uk
Marco WieringInstitute of Artificial Intelligence and Cognitive Engineering, University of GroningenΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα rug.nl
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Patrick M. PilarskiProfessor, University of Alberta, Amii (Alberta Machine Intelligence Institute)Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα ualberta.ca
A. Rupam MahmoodUniversity of Alberta, AmiiΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα ualberta.ca
Marlos C. MachadoUniversity of AlbertaΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα ualberta.ca

Παρακολούθηση

Harm van Seijen

Sony AI

Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα sony.com

reinforcement learning machine learning representation learning


Τίτλος Ταξινόμηση με βάση τις αναφορές Ταξινόμηση κατά έτος Ταξινόμηση κατά τίτλο	Παρατίθεται από Παρατίθεται από	Έτος
Reducing network agnostophobia AR Dhamija, M Günther, T Boult Advances in Neural Information Processing Systems 31, 2018	364	2018
Hybrid reward architecture for reinforcement learning H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang Advances in Neural Information Processing Systems 30, 2017	290	2017
A theoretical and empirical analysis of expected sarsa H Van Seijen, H Van Hasselt, S Whiteson, M Wiering 2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009	281	2009
True online TD (lambda) H Seijen, R Sutton International Conference on Machine Learning, 692-700, 2014	130	2014
True online temporal-difference learning H Van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton Journal of Machine Learning Research 17 (145), 1-40, 2016	115	2016
Systematic generalisation with group invariant predictions F Ahmed, Y Bengio, H Van Seijen, A Courville International Conference on Learning Representations, 2020	101	2020
A Deeper Look at Planning as Learning from Replay H van Seijen, RS Sutton International Conference on Machine Learning, 2015	78	2015
Planning by prioritized sweeping with small backups H Van Seijen, R Sutton International Conference on Machine Learning, 361-369, 2013	61*	2013
Modular lifelong reinforcement learning via neural composition JA Mendez, H van Seijen, E Eaton arXiv preprint arXiv:2207.00429, 2022	43	2022
Hybrid reward architecture for reinforcement learning HH Van Seijen, SMF Booshehri, RMH Laroche, JS Romoff US Patent 10,977,551, 2021	43	2021
Using a logarithmic mapping to enable lower discount factors in reinforcement learning H Van Seijen, M Fatemi, A Tavakoli Advances in Neural Information Processing Systems 32, 2019	31	2019
Multi-advisor reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen arXiv preprint arXiv:1704.00756, 2017	28	2017
On value function representation of long horizon problems L Lehnert, R Laroche, H van Seijen Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	27	2018
Exploiting Best-Match Equations for Efficient Reinforcement Learning. H van Seijen, S Whiteson, H van Hasselt, M Wiering Journal of Machine Learning Research 12 (6), 2011	27	2011
Effective multi-step temporal-difference learning for non-linear function approximation H van Seijen arXiv preprint arXiv:1608.05151, 2016	23	2016
Dead-ends and secure exploration in reinforcement learning M Fatemi, S Sharma, H Van Seijen, SE Kahou International Conference on Machine Learning, 1873-1881, 2019	21	2019
Efficient abstraction selection in reinforcement learning H van Seijen, S Whiteson, L Kester Computational Intelligence 30 (4), 657-699, 2014	19	2014
Learning invariances for policy generalization R Tachet, P Bachman, H van Seijen arXiv preprint arXiv:1809.02591, 2018	16	2018
Separation of concerns in reinforcement learning H van Seijen, M Fatemi, J Romoff, R Laroche arXiv preprint arXiv:1612.05159, 2016	15	2016
Agent-controller representations: Principled offline rl with rich exogenous information R Islam, M Tomar, A Lamb, Y Efroni, H Zang, A Didolkar, D Misra, X Li, ... arXiv preprint arXiv:2211.00164, 2022	13	2022

Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.

Άρθρα 1–20

Παραθέσεις ανά έτος

Διπλότυπες αναφορές

Συγχωνευμένες αναφορές

Προσθήκη από κοινού συγγραφέωνΣυν-συγγραφείς

Παρακολούθηση

Παρατίθεται από

Συν-συγγραφείς