no code implementations • 29 Sep 2021 • Jian Hu, Siyang Jiang, Seth Austin Harding, Haibin Wu, Shih-wei Liao
QMIX, a popular MARL algorithm based on the monotonicity constraint, has been used as a baseline for the benchmark environments, such as Starcraft Multi-Agent Challenge (SMAC), Predator-Prey (PP).
2 code implementations • 6 Feb 2021 • Jian Hu, Siyang Jiang, Seth Austin Harding, Haibin Wu, Shih-wei Liao
Multi-Agent Reinforcement Learning (MARL) has seen revolutionary breakthroughs with its successful application to multi-agent cooperative tasks such as computer games and robot swarms.
no code implementations • 9 Sep 2020 • Jian Hu, Seth Austin Harding, Haibin Wu, Siyue Hu, Shih-wei Liao
Existing methods such as Value Decomposition Network (VDN) and QMIX estimate the value of long-term returns as a scalar that does not contain the information of randomness.