no code implementations • 8 Sep 2023 • David Yunis, Justin Jung, Falcon Dai, Matthew Walter
Exploration in sparse-reward reinforcement learning is difficult due to the requirement of long, coordinated sequences of actions in order to achieve any reward.
no code implementations • NeurIPS 2019 • Falcon Dai, Matthew Walter
By analyzing the change in the maximum expected hitting cost, this work presents a formal understanding of the effect of potential-based reward shaping on regret (and sample complexity) in the undiscounted average reward setting.