no code implementations • 1 Nov 2023 • Rizhong Wang, Huiping Li, Di Cui, Demin Xu
Once a joint policy is obtained, it is critical to design a value function factorization method to extract optimal decentralized policies for the agents, which needs to satisfy the individual-global-max (IGM) principle.