Search Results for author: Guangwen Yang

Found 19 papers, 5 papers with code

RecycleGPT: An Autoregressive Language Model with Recyclable Module

no code implementations • 7 Aug 2023 • Yufan Jiang, Qiaozhi He, Xiaomin Zhuang, Zhihua Wu, Kunpeng Wang, Wenlai Zhao, Guangwen Yang

Existing large language models have to run K times to generate a sequence of K tokens.

Language Modelling Text Generation

Paper
Add Code

A Joint Time-frequency Domain Transformer for Multivariate Time Series Forecasting

1 code implementation • 24 May 2023 • Yushu Chen, Shengzhuo Liu, Jinzhe Yang, Hao Jing, Wenlai Zhao, Guangwen Yang

In order to enhance the performance of Transformer models for long-term multivariate forecasting while minimizing computational demands, this paper introduces the Joint Time-Frequency Domain Transformer (JTFT).

Multivariate Time Series Forecasting Time Series

Paper
Code

A Fast Algorithm for Onboard Atmospheric Powered Descent Guidance

no code implementations • 9 Sep 2022 • Yushu Chen, Guangwen Yang, Lu Wang, Qingzhong Gan, Haipeng Chen, Quanyong Xu

Atmospheric powered descent guidance can be solved by successive convexification; however, its onboard application is impeded by the sharp increase in computation caused by nonlinear aerodynamic forces.

Paper
Add Code

Efficient Climate Simulation via Machine Learning Method

no code implementations • 15 Aug 2022 • Xin Wang, Wei Xue, Yilun Han, Guangwen Yang

We develop a user-friendly platform NeuroGCM for efficiently developing hybrid modeling in climate simulation.

Paper
Add Code

Heterogeneous Information Network based Default Analysis on Banking Micro and Small Enterprise Users

no code implementations • 24 Apr 2022 • Zheng Zhang, Yingsheng Ji, Jiachen Shen, Xi Zhang, Guangwen Yang

Risk assessment is a substantial problem for financial institutions that has been extensively studied both for its methodological richness and its various practical applications.

Feature Engineering Implicit Relations

Paper
Add Code

RD$^2$: Reward Decomposition with Representation Decomposition

no code implementations • NeurIPS 2020 • Zichuan Lin, Derek Yang, Li Zhao, Tao Qin, Guangwen Yang, Tie-Yan Liu

In this work, we propose a set of novel reward decomposition principles by constraining uniqueness and compactness of different state features/representations relevant to different sub-rewards.

Paper
Add Code

Model-based Adversarial Meta-Reinforcement Learning

1 code implementation • NeurIPS 2020 • Zichuan Lin, Garrett Thomas, Guangwen Yang, Tengyu Ma

When the test task distribution is different from the training task distribution, the performance may degrade significantly.

Continuous Control Meta Reinforcement Learning +2

Paper
Code

Episodic Reinforcement Learning with Associative Memory

no code implementations • ICLR 2020 • Guangxiang Zhu*, Zichuan Lin*, Guangwen Yang, Chongjie Zhang

Sample efficiency has been one of the major challenges for deep reinforcement learning.

Atari Games reinforcement-learning +1

Paper
Add Code

The Deep Learning Compiler: A Comprehensive Survey

1 code implementation • 6 Feb 2020 • Mingzhen Li, Yi Liu, Xiaoyan Liu, Qingxiao Sun, Xin You, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian

In this paper, we perform a comprehensive survey of existing DL compilers by dissecting the commonly adopted design in details, with emphasis on the DL oriented multi-level IRs, and frontend/backend optimizations.

Paper
Code

OpenArray v1.0: a simple operator library for the decoupling of ocean modeling and parallel computing

1 code implementation • Geoscientific Model Development 2019 • Xiaomeng Huang, Xing Huang, Dong Wang, Qi Wu, Yi Li, Shixun Zhang, YuWen Chen, Mingqing Wang, Yuan Gao, Qiang Tang, Yue Chen, Zheng Fang, Zhenya Song, Guangwen Yang

In this work, we design a simple computing library to bridge the gap and decouple the work of ocean modeling from parallel computing.

334

Paper
Code

Distributional Reward Decomposition for Reinforcement Learning

no code implementations • NeurIPS 2019 • Zichuan Lin, Li Zhao, Derek Yang, Tao Qin, Guangwen Yang, Tie-Yan Liu

Many reinforcement learning (RL) tasks have specific properties that can be leveraged to modify existing RL algorithms to adapt to those tasks and further improve performance, and a general class of such properties is the multiple reward channel.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

An Adaptive Remote Stochastic Gradient Method for Training Neural Networks

1 code implementation • 4 May 2019 • Yushu Chen, Hao Jing, Wenlai Zhao, Zhi-Qiang Liu, Ouyi Li, Liang Qiao, Wei Xue, Guangwen Yang

RSG is further combined with adaptive methods to construct ARSG for acceleration.

Paper
Code

swTVM: Towards Optimized Tensor Code Generation for Deep Learning on Sunway Many-Core Processor

no code implementations • 16 Apr 2019 • Mingzhen Li, Changxi Liu, Jianjin Liao, Xuegui Zheng, Hailong Yang, Rujun Sun, Jun Xu, Lin Gan, Guangwen Yang, Zhongzhi Luan, Depei Qian

The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability.

Code Generation

Paper
Add Code

swCaffe: a Parallel Framework for Accelerating Deep Learning Applications on Sunway TaihuLight

no code implementations • 16 Mar 2019 • Jiarui Fang, Liandeng Li, Haohuan Fu, Jinlei Jiang, Wenlai Zhao, Conghui He, Xin You, Guangwen Yang

Second, we propose a set of optimization strategies for redesigning a variety of neural network layers based on Caffe.

Paper
Add Code

Quantum Teleportation-Inspired Algorithm for Sampling Large Random Quantum Circuits

no code implementations • 15 Jan 2019 • Ming-Cheng Chen, Riling Li, Lin Gan, Xiaobo Zhu, Guangwen Yang, Chao-Yang Lu, Jian-Wei Pan

We show that low-depth random quantum circuits can be efficiently simulated by a quantum teleportation-inspired algorithm.

Quantum Physics

Paper
Add Code

RedSync : Reducing Synchronization Traffic for Distributed Deep Learning

no code implementations • ICLR 2019 • Jiarui Fang, Haohuan Fu, Guangwen Yang, Cho-Jui Hsieh

Data parallelism has become a dominant method to scale Deep Neural Network (DNN) training across multiple nodes.

Image Classification Language Modelling

Paper
Add Code

Episodic Memory Deep Q-Networks

no code implementations • 19 May 2018 • Zichuan Lin, Tianqi Zhao, Guangwen Yang, Lintao Zhang

Reinforcement learning (RL) algorithms have made huge progress in recent years by leveraging the power of deep neural networks (DNN).

Atari Games Reinforcement Learning (RL)

Paper
Add Code

Quantum Supremacy Circuit Simulation on Sunway TaihuLight

no code implementations • 13 Apr 2018 • Riling Li, Bujiao Wu, Mingsheng Ying, Xiaoming Sun, Guangwen Yang

We design a large-scale simulator of universal random quantum circuits, often called 'quantum supremacy circuits', and implement it on Sunway TaihuLight.

Quantum Physics

Paper
Add Code

Cavs: A Vertex-centric Programming Interface for Dynamic Neural Networks

no code implementations • 11 Dec 2017 • Hao Zhang, Shizhen Xu, Graham Neubig, Wei Dai, Qirong Ho, Guangwen Yang, Eric P. Xing

Recent deep learning (DL) models have moved beyond static network architectures to dynamic ones, handling data where the network structure changes every example, such as sequences of variable lengths, trees, and graphs.

graph construction Management +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.