no code implementations • 17 Mar 2024 • Jingcheng Jiang, Haiyin Piao, Yu Fu, Yihang Hao, Chuanlu Jiang, Ziqi Wei, Xin Yang
Furthermore, we construct a dogfight scenario for aerial agents to demonstrate the practicality of the PDO algorithm.
no code implementations • 30 Dec 2023 • Yaqing Hou, Mingyang Sun, Abhishek Gupta, Yaochu Jin, Haiyin Piao, Hongwei Ge, Qiang Zhang
In this paper, we scale evolutionary algorithms to high-dimensional optimization problems that deceptively possess a low effective dimensionality (certain dimensions do not significantly affect the objective function).
no code implementations • 19 Dec 2023 • Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun
In reinforcement learning, the optimism in the face of uncertainty (OFU) is a mainstream principle for directing exploration towards less explored areas, characterized by higher uncertainty.
no code implementations • 10 Oct 2023 • Yang Wang, Bo Dong, Ke Xu, Haiyin Piao, Yufei Ding, BaoCai Yin, Xin Yang
Hence, given different inputs, it requires different time for converging to an adversarial sample.
no code implementations • 13 Jun 2023 • Siyuan Guo, Yanchao Sun, Jifeng Hu, Sili Huang, Hechang Chen, Haiyin Piao, Lichao Sun, Yi Chang
However, constrained by the limited quality of the offline dataset, its performance is often sub-optimal.
1 code implementation • 14 Oct 2022 • Jifeng Hu, Yanchao Sun, Hechang Chen, Sili Huang, Haiyin Piao, Yi Chang, Lichao Sun
Our main idea is to design the multi-action-branch reward estimation and policy-weighted reward aggregation for stabilized training.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 20 May 2022 • Xing Chen, Dongcui Diao, Hechang Chen, Hengshuai Yao, Haiyin Piao, Zhixiao Sun, Zhiwei Yang, Randy Goebel, Bei Jiang, Yi Chang
The popular Proximal Policy Optimization (PPO) algorithm approximates the solution in a clipped policy space.
1 code implementation • NeurIPS 2021 • Zifan Wu, Chao Yu, Deheng Ye, Junge Zhang, Haiyin Piao, Hankz Hankui Zhuo
We present Coordinated Proximal Policy Optimization (CoPPO), an algorithm that extends the original Proximal Policy Optimization (PPO) to the multi-agent setting.
1 code implementation • 21 Apr 2021 • Jiqing Zhang, Chengjiang Long, Yuxin Wang, Haiyin Piao, Haiyang Mei, Xin Yang, BaoCai Yin
Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and contribute remarkable progress.