1 code implementation • 5 Mar 2024 • Ke Zhang, Dandan Zhu, Qiuhan Xu, Hao Zhou, Ce Zheng
Agents share Q-value network periodically during the training process.
Federated Learning reinforcement-learning +2