Search Results for author: Yicun Duan

Found 1 papers, 0 papers with code

Succinct Compression: Near-Optimal and Lossless Compression of Deep Neural Networks during Inference Runtime

no code implementations29 Sep 2021 Yicun Duan, Xiangjun Peng

However, those techniques do not keep the compressed representation during inference runtime, which incurs significant overheads in terms of both performance and space consumption.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.