Search Results for author: Zhengyao Jiang

Found 9 papers, 8 papers with code

H-GAP: Humanoid Control with a Generalist Planner

no code implementations • 5 Dec 2023 • Zhengyao Jiang, Yingchen Xu, Nolan Wagener, Yicheng Luo, Michael Janner, Edward Grefenstette, Tim Rocktäschel, Yuandong Tian

However, the extensive collection of human motion-captured data and the derived datasets of humanoid trajectories, such as MoCapAct, paves the way to tackle these challenges.

Humanoid Control Model Predictive Control +1

Paper
Add Code

Mildly Constrained Evaluation Policy for Offline Reinforcement Learning

1 code implementation • 6 Jun 2023 • Linjie Xu, Zhengyao Jiang, Jinyu Wang, Lei Song, Jiang Bian

Offline reinforcement learning (RL) methodologies enforce constraints on the policy to adhere closely to the behavior policy, thereby stabilizing value learning and mitigating the selection of out-of-distribution (OOD) actions during test time.

Offline RL reinforcement-learning +1

Paper
Code

Optimal Transport for Offline Imitation Learning

1 code implementation • 24 Mar 2023 • Yicheng Luo, Zhengyao Jiang, samuel cohen, Edward Grefenstette, Marc Peter Deisenroth

In this paper, we introduce Optimal Transport Reward labeling (OTR), an algorithm that assigns rewards to offline trajectories, with a few high-quality demonstrations.

D4RL Imitation Learning +2

Paper
Code

Efficient Planning in a Compact Latent Action Space

1 code implementation • 22 Aug 2022 • Zhengyao Jiang, Tianjun Zhang, Michael Janner, Yueying Li, Tim Rocktäschel, Edward Grefenstette, Yuandong Tian

Planning-based reinforcement learning has shown strong performance in tasks in discrete and low-dimensional continuous action spaces.

Continuous Control Decision Making +2

Paper
Code

Graph Backup: Data Efficient Backup Exploiting Markovian Transitions

1 code implementation • 31 May 2022 • Zhengyao Jiang, Tianjun Zhang, Robert Kirk, Tim Rocktäschel, Edward Grefenstette

In this paper, we treat the transition data of the MDP as a graph, and define a novel backup operator, Graph Backup, which exploits this graph structure for better value estimation.

Atari Games counterfactual +2

Paper
Code

Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning

1 code implementation • 8 Feb 2021 • Zhengyao Jiang, Pasquale Minervini, Minqi Jiang, Tim Rocktaschel

In this work, we show that we can incorporate relational inductive biases, encoded in the form of relational graphs, into agents.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Neural Logic Reinforcement Learning

1 code implementation • 24 Apr 2019 • Zhengyao Jiang, Shan Luo

Deep reinforcement learning (DRL) has achieved significant breakthroughs in various tasks.

Inductive logic programming Policy Gradient Methods +2

Paper
Code

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

27 code implementations • 30 Jun 2017 • Zhengyao Jiang, Dixing Xu, Jinjun Liang

They are, along with a number of recently reviewed or published portfolio-selection strategies, examined in three back-test experiments with a trading period of 30 minutes in a cryptocurrency market.

Management Portfolio Optimization +2

1,703

Paper
Code

Cryptocurrency Portfolio Management with Deep Reinforcement Learning

3 code implementations • 5 Dec 2016 • Zhengyao Jiang, Jinjun Liang

Portfolio management is the decision-making process of allocating an amount of fund into different financial investment products.

Decision Making Management +2

1,703

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.