Search Results for author: Théo Vincent

Found 3 papers, 1 papers with code

Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning

no code implementations25 May 2024 Théo Vincent, Fabian Wahren, Jan Peters, Boris Belousov, Carlo D'Eramo

Deep Reinforcement Learning (RL) is well known for being highly sensitive to hyperparameters, requiring practitioners substantial efforts to optimize them for the problem at hand.

Iterated $Q$-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning

no code implementations4 Mar 2024 Théo Vincent, Daniel Palenicek, Boris Belousov, Jan Peters, Carlo D'Eramo

It has been observed that this scheme can be potentially generalized to carry out multiple iterations of the Bellman operator at once, benefiting the underlying learning algorithm.

Atari Games Continuous Control +1

Parameterized Projected Bellman Operator

1 code implementation20 Dec 2023 Théo Vincent, Alberto Maria Metelli, Boris Belousov, Jan Peters, Marcello Restelli, Carlo D'Eramo

We formulate an optimization problem to learn PBO for generic sequential decision-making problems, and we theoretically analyze its properties in two representative classes of RL problems.

Decision Making Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.