Daniël Willemsen
Daniël Willemsen
Verified email at student.tudelft.nl - Homepage
Title
Cited by
Cited by
Year
Value targets in off-policy AlphaZero: a new greedy backup
D Willemsen, H Baier, M Kaisers
Adaptive and Learning Agents (ALA) Workshop, 2020
32020
MAMBPO: Sample-efficient multi-robot reinforcement learning using learned world models
D Willemsen, M Coppola, GCHE de Croon
arXiv preprint arXiv:2103.03662, 2021
2021
Sample-efficient multi-agent reinforcement learning using learned world models
D Willemsen
Delft University of Technology, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–3