no code implementations • 15 Sep 2021 • Ruizhen Liu, Dazhi Zhong, Zhicong Chen
We consider the problem of offline reinforcement learning with model-based control, whose goal is to learn a dynamics model from the experience replay and obtain a pessimism-oriented agent under the learned model.