1 code implementation • 7 Sep 2020 • Nicola Castaman, Enrico Pagello, Emanuele Menegatti, Alberto Pretto
Our approach iteratively solves a reduced planning problem over a receding window of a limited number of future actions during the implementation of the actions.
Robotics
no code implementations • 6 May 2020 • Andrea Franceschetti, Elisa Tosello, Nicola Castaman, Stefano Ghidoni
This paper proposes a detailed and extensive comparison of the Trust Region Policy Optimization and DeepQ-Network with Normalized Advantage Functions with respect to other state of the art algorithms, namely Deep Deterministic Policy Gradient and Vanilla Policy Gradient.