no code implementations • 10 Sep 2022 • Yunxiao Guo, Xinjia Xie, Runhao Zhao, Chenglan Zhu, Jiangting Yin, Han Long
As for cooperation, we design the agents' reward for flocking tasks according to the boids model.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 20 Oct 2021 • Yunxiao Guo, Han Long, Xiaojun Duan, Kaiyuan Feng, Maochu Li, Xiaying Ma
As an algorithm based on deep reinforcement learning, Proximal Policy Optimization (PPO) performs well in many complex tasks and has become one of the most popular RL algorithms in recent years.