no code implementations • 14 Apr 2023 • Julian Burghoff, Marc Heinrich Monells, Hanno Gottschalk
The highly structured energy landscape of the loss as a function of parameters for deep neural networks makes it necessary to use sophisticated optimization strategies in order to discover (local) minima that guarantee reasonable performance.