1 code implementation • 24 Oct 2022 • Benoit Steiner, Mostafa Elhoushi, Jacob Kahn, James Hegarty
We present OLLA, an algorithm that optimizes the lifetime and memory location of the tensors used to train neural networks.
no code implementations • 27 Aug 2021 • Shikhar Singh, Benoit Steiner, James Hegarty, Hugh Leather
State-of-the-art deep-learning compilers like TVM and Halide incorporate a learning-based performance model to search the space of valid implementations of a given deep learning algorithm.