no code implementations • 3 May 2023 • Kairui Guo, Adrian Cheng, Yaqi Li, Jun Li, Rob Duffield, Steven W. Su
Based on the proposed co-adaptive MDPs, the simulation study indicates the non-stationary problem can be mitigated using various proposed Policy Improvement approaches.
Model-based Reinforcement Learning Multi-agent Reinforcement Learning +1