no code implementations • 11 Nov 2021 • John C. Raisbeck, Matthew W. Allen, Hakho Lee
The resulting definition of exploration can be applied in infinite problems and non-dynamic learning methods, which the dynamic notion of exploration cannot tolerate.
no code implementations • 27 Dec 2019 • John C. Raisbeck, Matthew Allen, Ralph Weissleder, Hyungsoon Im, Hakho Lee
Since the debut of Evolution Strategies (ES) as a tool for Reinforcement Learning by Salimans et al. 2017, there has been interest in determining the exact relationship between the Evolution Strategies gradient and the gradient of a similar class of algorithms, Finite Differences (FD).