1 code implementation • 24 Aug 2022 • Mahsa Asadi, Aurélien Bellet, Odalric-Ambrym Maillard, Marc Tommasi
We study the case where some of the distributions have the same mean, and the agents are allowed to actively query information from other agents.
no code implementations • 9 Oct 2019 • Mahsa Asadi, Mohammad Sadegh Talebi, Hippolyte Bourel, Odalric-Ambrym Maillard
In the case of an unknown equivalence structure, we show through numerical experiments that C-UCRL combined with ApproxEquivalence outperforms UCRL2 in ergodic MDPs.
Model-based Reinforcement Learning reinforcement-learning +1