no code implementations • 8 May 2023 • Elena Shrestha, Chetan Reddy, Hanxi Wan, Yulun Zhuang, Ram Vasudevan
As a result, MBRL agents may converge to sub-optimal policies if the world model is inaccurate.
Continuous Control Model-based Reinforcement Learning +1