1 code implementation • 10 Oct 2022 • Wubing Chen, Wenbin Li, Xiao Liu, Shangdong Yang, Yang Gao
Empirically, we evaluate MAPPG on the well-known matrix game and differential game, and verify that MAPPG can converge to the global optimum for both discrete and continuous action spaces.
Multi-agent Reinforcement Learning reinforcement-learning +3
no code implementations • 2 Mar 2022 • Xiao Liu, Shuyang Liu, Wenbin Li, Shangdong Yang, Yang Gao
Although deep reinforcement learning has become a universal solution for complex control tasks, its real-world applicability is still limited because lacking security guarantees for policies.
no code implementations • 22 Jan 2022 • Guang Yang, Xingguo Chen, Shangdong Yang, Huihui Wang, Shaokang Dong, Yang Gao
Moreover, in learning sparse representations, attention mechanisms are utilized to represent the degree of sparsification, and a smooth attentive function is introduced into the kernel-based VFA.