1 code implementation • 16 Dec 2023 • Ruining Zhang, Haoran Han, Maolong Lv, Qisong Yang, Jian Cheng
Extensive utilization of deep reinforcement learning (DRL) policy networks in diverse continuous control tasks has raised questions regarding performance degradation in expansive state spaces where the input state norm is larger than that in the training environment.