no code implementations • 3 Oct 2023 • Jack W. Lindsey, Samuel Lippl
Our findings hold qualitatively for a deep architecture trained on image classification tasks, and our characterization of the nested feature selection regime motivates a modification to PT+FT that we find empirically improves performance.