Search Results for author: Davide Mambelli

Found 2 papers, 0 papers with code

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

no code implementations • 19 Feb 2024 • Davide Mambelli, Stephan Bongers, Onno Zoeter, Matthijs T. J. Spaan, Frans A. Oliehoek

A well-established off-policy objective is the excursion objective.

Policy Gradient Methods

Paper
Add Code

Compositional Multi-Object Reinforcement Learning with Linear Relation Networks

no code implementations • 31 Jan 2022 • Davide Mambelli, Frederik Träuble, Stefan Bauer, Bernhard Schölkopf, Francesco Locatello

Although reinforcement learning has seen remarkable progress over the last years, solving robust dexterous object-manipulation tasks in multi-object settings remains a challenge.

Object reinforcement-learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.