Follow
DaniŽl Willemsen
DaniŽl Willemsen
Verified email at student.tudelft.nl - Homepage
Title
Cited by
Cited by
Year
MAMBPO: Sample-efficient multi-robot reinforcement learning using learned world models
D Willemsen, M Coppola, GCHE de Croon
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems†…, 2021
292021
Value targets in off-policy AlphaZero: a new greedy backup
D Willemsen, H Baier, M Kaisers
Neural Computing and Applications 34 (3), 1801-1814, 2022
82022
Sample-efficient multi-agent reinforcement learning using learned world models
D Willemsen
Delft University of Technology, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–3