no code implementations • 21 Nov 2022 • Jean Pachebat, Sergei Ivanov
Unlike neural networks with millions of trainable parameters, GBDTs optimize loss function in an additive manner and have a single trainable parameter per leaf, which makes it easy to apply high-order optimization of the loss function.