Search Results for author: Luke Walters

Found 2 papers, 1 papers with code

KrADagrad: Kronecker Approximation-Domination Gradient Preconditioned Stochastic Optimization

1 code implementation30 May 2023 Jonathan Mei, Alexander Moreno, Luke Walters

Second order stochastic optimizers allow parameter update step size and direction to adapt to loss curvature, but have traditionally required too much memory and compute for deep learning.

Stochastic Optimization

SKI to go Faster: Accelerating Toeplitz Neural Networks via Asymmetric Kernels

no code implementations15 May 2023 Alexander Moreno, Jonathan Mei, Luke Walters

For the low rank component, we replace the RPE MLP with linear interpolation and use asymmetric Structured Kernel Interpolation (SKI) (Wilson et.

Cannot find the paper you are looking for? You can Submit a new open access paper.