1 code implementation • 10 Sep 2020 • Marcin Szulc, Jakub Łyskawa, Paweł Wawrzyński
Consequently, an agent learns from experiments that are distributed over time and potentially give better clues to policy improvement.
reinforcement-learning Reinforcement Learning (RL)