no code implementations • 1 Nov 2023 • Rizhong Wang, Huiping Li, Di Cui, Demin Xu
Once a joint policy is obtained, it is critical to design a value function factorization method to extract optimal decentralized policies for the agents, which needs to satisfy the individual-global-max (IGM) principle.
no code implementations • 13 Sep 2023 • Zhuoying Chen, Huiping Li, Rizhong Wang
Prioritized Experience Replay (PER) is a technical means of deep reinforcement learning by selecting experience samples with more knowledge quantity to improve the training rate of neural network.