no code implementations • 12 Apr 2023 • Amir M. Soufi Enayati, Zengjie Zhang, Kashish Gupta, Homayoun Najjaran
A comparison study between the proposed method and a traditional off-policy reinforcement learning algorithm indicates its advantage in learning performance and potential value for applications.