no code implementations • 18 Apr 2024 • Hector Kohler, Benoit Clement, Thomas Chaffre, Gilles Le Chenadec
We perform this stability analysis on a LB adaptive control system whose adaptive parameters are determined using a Cross-Entropy Deep Learning method.
no code implementations • 16 Apr 2024 • Hector Kohler, Quentin Delfosse, Paul Festor, Philippe Preux
What reinforcement learning paradigms, are the most suited to develop interpretable agents?
no code implementations • 23 Sep 2023 • Hector Kohler, Riad Akrour, Philippe Preux
We show in this paper that deep RL can fail even on simple toy tasks of this class.
1 code implementation • 22 Sep 2023 • Hector Kohler, Riad Akrour, Philippe Preux
Finding an optimal decision tree for a supervised learning task is a challenging combinatorial problem to solve at scale.
no code implementations • 19 Jun 2023 • Timothée Mathieu, Riccardo Della Vecchia, Alena Shilova, Matheus Medeiros Centa, Hector Kohler, Odalric-Ambrym Maillard, Philippe Preux
When comparing several RL algorithms, a major question is how many executions must be made and how can we ensure that the results of such a comparison are theoretically sound.
no code implementations • 11 Apr 2023 • Hector Kohler, Riad Akrour, Philippe Preux
A given supervised classification task is modeled as a Markov decision problem (MDP) and then augmented with additional actions that gather information about the features, equivalent to building a DT.