Search Results for author: Steven Bohez

Found 16 papers, 3 papers with code

Barkour: Benchmarking Animal-level Agility with Quadruped Robots

no code implementations • 24 May 2023 • Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee, Linda Luu, Ofir Nachum, Ken Oslund, Jason Powell, Diego Reyes, Francesco Romano, Feresteh Sadeghi, Ron Sloat, Baruch Tabanpour, Daniel Zheng, Michael Neunert, Raia Hadsell, Nicolas Heess, Francesco Nori, Jeff Seto, Carolina Parada, Vikas Sindhwani, Vincent Vanhoucke, Jie Tan

In the second approach, we distill the specialist skills into a Transformer-based generalist locomotion policy, named Locomotion-Transformer, that can handle various terrains and adjust the robot's gait based on the perceived environment and robot states.

Benchmarking Navigate

Paper
Add Code

NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields

no code implementations • 10 Oct 2022 • Arunkumar Byravan, Jan Humplik, Leonard Hasenclever, Arthur Brussee, Francesco Nori, Tuomas Haarnoja, Ben Moran, Steven Bohez, Fereshteh Sadeghi, Bojan Vujatovic, Nicolas Heess

A simulation is then created using the rendering engine in a physics simulator which computes contact dynamics from the static scene geometry (estimated from the NeRF volume density) and the dynamic objects' geometry and physical properties (assumed known).

Novel View Synthesis

Paper
Add Code

Forgetting and Imbalance in Robot Lifelong Learning with Off-policy Data

no code implementations • 12 Apr 2022 • Wenxuan Zhou, Steven Bohez, Jan Humplik, Abbas Abdolmaleki, Dushyant Rao, Markus Wulfmeier, Tuomas Haarnoja, Nicolas Heess

We propose the Offline Distillation Pipeline to break this trade-off by separating the training procedure into an online interaction phase and an offline distillation phase. Second, we find that training with the imbalanced off-policy data from multiple environments across the lifetime creates a significant performance drop.

Reinforcement Learning (RL)

Paper
Add Code

Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors

no code implementations • 31 Mar 2022 • Steven Bohez, Saran Tunyasuvunakool, Philemon Brakel, Fereshteh Sadeghi, Leonard Hasenclever, Yuval Tassa, Emilio Parisotto, Jan Humplik, Tuomas Haarnoja, Roland Hafner, Markus Wulfmeier, Michael Neunert, Ben Moran, Noah Siegel, Andrea Huber, Francesco Romano, Nathan Batchelor, Federico Casarini, Josh Merel, Raia Hadsell, Nicolas Heess

We investigate the use of prior knowledge of human and animal movement to learn reusable locomotion skills for real legged robots.

Paper
Add Code

Learning Coordinated Terrain-Adaptive Locomotion by Imitating a Centroidal Dynamics Planner

no code implementations • 30 Oct 2021 • Philemon Brakel, Steven Bohez, Leonard Hasenclever, Nicolas Heess, Konstantinos Bousmalis

Imitation learning circumvents this problem and has been used with motion capture data to extract quadruped gaits for flat terrains.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

Explicit Pareto Front Optimization for Constrained Reinforcement Learning

no code implementations • 1 Jan 2021 • Sandy Huang, Abbas Abdolmaleki, Philemon Brakel, Steven Bohez, Nicolas Heess, Martin Riedmiller, Raia Hadsell

We propose a framework that uses a multi-objective RL algorithm to find a Pareto front of policies that trades off between the reward and constraint(s), and simultaneously searches along this front for constraint-satisfying policies.

Continuous Control reinforcement-learning +1

Paper
Add Code

dm_control: Software and Tasks for Continuous Control

2 code implementations • 22 Jun 2020 • Yuval Tassa, Saran Tunyasuvunakool, Alistair Muldal, Yotam Doron, Piotr Trochim, Si-Qi Liu, Steven Bohez, Josh Merel, Tom Erez, Timothy Lillicrap, Nicolas Heess

The dm_control software package is a collection of Python libraries and task suites for reinforcement learning agents in an articulated-body simulation.

Continuous Control reinforcement-learning +1

3,576

Paper
Code

Relative Entropy Regularized Policy Iteration

1 code implementation • 5 Dec 2018 • Abbas Abdolmaleki, Jost Tobias Springenberg, Jonas Degrave, Steven Bohez, Yuval Tassa, Dan Belov, Nicolas Heess, Martin Riedmiller

Our algorithm draws on connections to existing literature on black-box optimization and 'RL as an inference' and it can be seen either as an extension of the Maximum a Posteriori Policy Optimisation algorithm (MPO) [Abdolmaleki et al., 2018a], or as an extension of Trust Region Covariance Matrix Adaptation Evolutionary Strategy (CMA-ES) [Abdolmaleki et al., 2017b; Hansen et al., 1997] to a policy iteration scheme.

Continuous Control OpenAI Gym +1

Paper
Code

Success at any cost: value constrained model-free continuous control

no code implementations • 27 Sep 2018 • Steven Bohez, Abbas Abdolmaleki, Michael Neunert, Jonas Buchli, Nicolas Heess, Raia Hadsell

We demonstrate the efficiency of our approach using a number of continuous control benchmark tasks as well as a realistic, energy-optimized quadruped locomotion task.

Continuous Control

Paper
Add Code

Sim-to-Real: Learning Agile Locomotion For Quadruped Robots

no code implementations • 27 Apr 2018 • Jie Tan, Tingnan Zhang, Erwin Coumans, Atil Iscen, Yunfei Bai, Danijar Hafner, Steven Bohez, Vincent Vanhoucke

The control policies are learned in a physics simulator and then deployed on real robots.

Paper
Add Code

Transfer Learning with Binary Neural Networks

no code implementations • 29 Nov 2017 • Sam Leroux, Steven Bohez, Tim Verbelen, Bert Vankeirsbilck, Pieter Simoens, Bart Dhoedt

Binary neural networks are attractive in this case because the logical operations are very fast and efficient when implemented in hardware.

Transfer Learning

Paper
Add Code

Decoupled Learning of Environment Characteristics for Safe Exploration

no code implementations • 9 Aug 2017 • Pieter Van Molle, Tim Verbelen, Steven Bohez, Sam Leroux, Pieter Simoens, Bart Dhoedt

However, when learning a task using reinforcement learning, the agent cannot distinguish the characteristics of the environment from those of the task.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Sensor Fusion for Robot Control through Deep Reinforcement Learning

no code implementations • 13 Mar 2017 • Steven Bohez, Tim Verbelen, Elias De Coninck, Bert Vankeirsbilck, Pieter Simoens, Bart Dhoedt

Deep reinforcement learning is becoming increasingly popular for robot control algorithms, with the aim for a robot to self-learn useful feature representations from unstructured sensory input leading to the optimal actuation policy.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Lazy Evaluation of Convolutional Filters

no code implementations • 27 May 2016 • Sam Leroux, Steven Bohez, Cedric De Boom, Elias De Coninck, Tim Verbelen, Bert Vankeirsbilck, Pieter Simoens, Bart Dhoedt

In this paper we propose a technique which avoids the evaluation of certain convolutional filters in a deep neural network.

Paper
Add Code

Efficiency Evaluation of Character-level RNN Training Schedules

1 code implementation • 9 May 2016 • Cedric De Boom, Sam Leroux, Steven Bohez, Pieter Simoens, Thomas Demeester, Bart Dhoedt

We present four training and prediction schedules from the same character-level recurrent neural network.

Paper
Code

Learning Semantic Similarity for Very Short Texts

no code implementations • 2 Dec 2015 • Cedric De Boom, Steven Van Canneyt, Steven Bohez, Thomas Demeester, Bart Dhoedt

We therefore investigated several text representations as a combination of word embeddings in the context of semantic pair matching.

Information Retrieval Retrieval +5

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.