Search Results for author: Longwei Zou

Found 2 papers, 1 papers with code

CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers

no code implementations10 Apr 2024 Longwei Zou, Qingyang Wang, Han Zhao, Jiangang Kong, Yi Yang, Yangdong Deng

The fast-growing large scale language models are delivering unprecedented performance on almost all natural language processing tasks.

Quantization

A Multi-Level Framework for Accelerating Training Transformer Models

1 code implementation7 Apr 2024 Longwei Zou, Han Zhang, Yangdong Deng

Specifically, the framework is based on three basic operators, Coalescing, De-coalescing and Interpolation, which can be orchestrated to build a multi-level training framework.

Cannot find the paper you are looking for? You can Submit a new open access paper.