no code implementations • 28 Nov 2023 • Daniel Barley, Holger Fröning
We report the effectiveness of activation pruning by evaluating training speed, accuracy, and memory usage of large-scale neural architectures on the example of ResMLP on image classification tasks.