Search Results for author: Haiyin Piao

Found 9 papers, 4 papers with code

Phasic Diversity Optimization for Population-Based Reinforcement Learning

no code implementations • 17 Mar 2024 • Jingcheng Jiang, Haiyin Piao, Yu Fu, Yihang Hao, Chuanlu Jiang, Ziqi Wei, Xin Yang

Furthermore, we construct a dogfight scenario for aerial agents to demonstrate the practicality of the PDO algorithm.

Multi-Armed Bandits reinforcement-learning

Paper
Add Code

Multiform Evolution for High-Dimensional Problems with Low Effective Dimensionality

no code implementations • 30 Dec 2023 • Yaqing Hou, Mingyang Sun, Abhishek Gupta, Yaochu Jin, Haiyin Piao, Hongwei Ge, Qiang Zhang

In this paper, we scale evolutionary algorithms to high-dimensional optimization problems that deceptively possess a low effective dimensionality (certain dimensions do not significantly affect the objective function).

Evolutionary Algorithms

Paper
Add Code

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

no code implementations • 19 Dec 2023 • Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun

In reinforcement learning, the optimism in the face of uncertainty (OFU) is a mainstream principle for directing exploration towards less explored areas, characterized by higher uncertainty.

Continuous Control

Paper
Add Code

A Geometrical Approach to Evaluate the Adversarial Robustness of Deep Neural Networks

no code implementations • 10 Oct 2023 • Yang Wang, Bo Dong, Ke Xu, Haiyin Piao, Yufei Ding, BaoCai Yin, Xin Yang

Hence, given different inputs, it requires different time for converging to an adversarial sample.

Adversarial Robustness

Paper
Add Code

A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning

no code implementations • 13 Jun 2023 • Siyuan Guo, Yanchao Sun, Jifeng Hu, Sili Huang, Hechang Chen, Haiyin Piao, Lichao Sun, Yi Chang

However, constrained by the limited quality of the offline dataset, its performance is often sub-optimal.

D4RL Efficient Exploration +3

Paper
Add Code

Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning

1 code implementation • 14 Oct 2022 • Jifeng Hu, Yanchao Sun, Hechang Chen, Sili Huang, Haiyin Piao, Yi Chang, Lichao Sun

Our main idea is to design the multi-action-branch reward estimation and policy-weighted reward aggregation for stabilized training.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure

1 code implementation • 20 May 2022 • Xing Chen, Dongcui Diao, Hechang Chen, Hengshuai Yao, Haiyin Piao, Zhixiao Sun, Zhiwei Yang, Randy Goebel, Bei Jiang, Yi Chang

The popular Proximal Policy Optimization (PPO) algorithm approximates the solution in a clipped policy space.

Efficient Exploration Policy Gradient Methods

Paper
Code

Coordinated Proximal Policy Optimization

1 code implementation • NeurIPS 2021 • Zifan Wu, Chao Yu, Deheng Ye, Junge Zhang, Haiyin Piao, Hankz Hankui Zhuo

We present Coordinated Proximal Policy Optimization (CoPPO), an algorithm that extends the original Proximal Policy Optimization (PPO) to the multi-agent setting.

Starcraft Starcraft II

Paper
Code

A Two-Stage Attentive Network for Single Image Super-Resolution

1 code implementation • 21 Apr 2021 • Jiqing Zhang, Chengjiang Long, Yuxin Wang, Haiyin Piao, Haiyang Mei, Xin Yang, BaoCai Yin

Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and contribute remarkable progress.

Image Reconstruction Image Super-Resolution +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.