Search Results for author: Maximilian Schlegel

Found 1 papers, 0 papers with code

Uncovering mesa-optimization algorithms in Transformers

no code implementations • 11 Sep 2023 • Johannes von Oswald, Eyvind Niklasson, Maximilian Schlegel, Seijin Kobayashi, Nicolas Zucchet, Nino Scherrer, Nolan Miller, Mark Sandler, Blaise Agüera y Arcas, Max Vladymyrov, Razvan Pascanu, João Sacramento

Transformers have become the dominant model in deep learning, but the reason for their superior performance is poorly understood.

In-Context Learning Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.