no code implementations • 6 Nov 2023 • Zhipeng Yao, Yu Zhang, Dazhou Li
To address this contradiction, we propose a novel optimization method that aims to accelerate the convergence rate of SGD without loss of generalization.