1 code implementation • 15 Feb 2024 • Quentin Gallouédec, Edward Beeching, Clément Romac, Emmanuel Dellandréa
The search for a general model that can operate seamlessly across multiple domains remains a key goal in machine learning research.
1 code implementation • 25 Oct 2023 • Lewis Tunstall, Edward Beeching, Nathan Lambert, Nazneen Rajani, Kashif Rasul, Younes Belkada, Shengyi Huang, Leandro von Werra, Clémentine Fourrier, Nathan Habib, Nathan Sarrazin, Omar Sanseviero, Alexander M. Rush, Thomas Wolf
Starting from a dataset of outputs ranked by a teacher model, we apply distilled direct preference optimization (dDPO) to learn a chat model with significantly improved intent alignment.
Ranked #7 on Zero-Shot Learning on MedConceptsQA
no code implementations • 22 Dec 2021 • Edward Beeching, Maxim Peter, Philippe Marcotte, Jilles Debangoye, Olivier Simonin, Joshua Romoff, Christian Wolf
We address planning and navigation in challenging 3D video games featuring maps with disconnected regions reachable by agents using special actions.
1 code implementation • 7 Dec 2021 • Edward Beeching, Jilles Debangoye, Olivier Simonin, Christian Wolf
We present Godot Reinforcement Learning (RL) Agents, an open-source interface for developing environments and agents in the Godot Game Engine.
1 code implementation • ECCV 2020 • Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf
We train an agent to navigate in 3D environments using a hierarchical strategy including a high-level graph based planner and a local policy.
no code implementations • 24 Jan 2020 • Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin
The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map.
1 code implementation • 3 Apr 2019 • Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin
In this paper we argue that research on training agents capable of complex reasoning can be simplified by decoupling from the requirement of high fidelity photographic observations.