Search Results for author: Aaron Archer

Found 1 papers, 0 papers with code

Practical Performance Guarantees for Pipelined DNN Inference

no code implementations • 7 Nov 2023 • Aaron Archer, Matthew Fahrbach, Kuikui Liu, Prakash Prabhu

We optimize pipeline parallelism for deep neural network (DNN) inference by partitioning model graphs into $k$ stages and minimizing the running time of the bottleneck stage, including communication.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.