no code implementations • 23 May 2024 • Charles A. Hepburn, Yue Jin, Giovanni Montana
Additionally, we introduce StaCQ, a deep learning algorithm that is both performance-driven on the D4RL benchmark datasets and closely aligned with our theoretical propositions.
no code implementations • 8 Dec 2022 • Charles A. Hepburn, Giovanni Montana
Furthermore, using the D4RL benchmarking suite, we demonstrate that state-of-the-art results are obtained by combining TS with two existing offline learning methodologies reliant on BC, model-based offline planning (MBOP) and policy constraint (TD3+BC).
no code implementations • 21 Nov 2022 • Charles A. Hepburn, Giovanni Montana
We propose a model-based data augmentation strategy, Trajectory Stitching (TS), to improve the quality of sub-optimal historical trajectories.