no code implementations • 3 May 2024 • Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal
The current state-of-the-art theoretical analysis of Actor-Critic (AC) algorithms significantly lags in addressing the practical aspects of AC implementations.
no code implementations • 18 Jun 2023 • Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal
To achieve that, we propose a Natural Actor-Critic algorithm with 2-Layer critic parametrization (NAC2L).
no code implementations • 14 Nov 2022 • Mudit Gaur, Vaneet Aggarwal, Mridul Agarwal
Deep Q-learning based algorithms have been applied successfully in many decision making problems, while their theoretical foundations are not as well understood.