no code implementations • 21 Nov 2020 • Elahe Aghapour, Nora Ayanian
In this paper, we propose a meta-reinforcement learning approach to learn the dynamic model of a non-stationary environment to be used for meta-policy optimization later.
Meta-Learning Meta Reinforcement Learning +3