Search Results for author: Junmin Zhong

Found 3 papers, 0 papers with code

Actor-Critic Reinforcement Learning with Phased Actor

no code implementations18 Apr 2024 Ruofan Wu, Junmin Zhong, Jennie Si

We prove qualitative properties of PAAC for learning convergence of the value and policy, solution optimality, and stability of system dynamics.

Policy Gradient Methods reinforcement-learning +1

Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for Deep Reinforcement Learning

no code implementations7 Nov 2023 Junmin Zhong, Ruofan Wu, Jennie Si

We address the issue of estimation bias in deep reinforcement learning (DRL) by introducing solution mechanisms that include a new, twin TD-regularized actor-critic (TDR) method.

Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems

no code implementations10 Oct 2022 Junmin Zhong, Ruofan Wu, Jennie Si

However, there is a lack of comprehensive and systematic study on this important aspect to demonstrate the effectiveness of multi-step methods in solving highly complex continuous control problems.

Continuous Control OpenAI Gym +2

Cannot find the paper you are looking for? You can Submit a new open access paper.