no code implementations • WS 2019 • Maxim Kodryan, Artem Grachev, Dmitry Ignatov, Dmitry Vetrov
Reduction of the number of parameters is one of the most important goals in Deep Learning.
Decoder Language Modelling +1