Search Results for author: Aaron Archer

Found 1 papers, 0 papers with code

Practical Performance Guarantees for Pipelined DNN Inference

no code implementations7 Nov 2023 Aaron Archer, Matthew Fahrbach, Kuikui Liu, Prakash Prabhu

We optimize pipeline parallelism for deep neural network (DNN) inference by partitioning model graphs into $k$ stages and minimizing the running time of the bottleneck stage, including communication.

Cannot find the paper you are looking for? You can Submit a new open access paper.