Search Results for author: Aleksandar Terzic

Found 3 papers, 0 papers with code

Limits of Transformer Language Models on Learning to Compose Algorithms

no code implementations • 8 Feb 2024 • Jonathan Thomm, Aleksandar Terzic, Giacomo Camposampiero, Michael Hersche, Bernhard Schölkopf, Abbas Rahimi

We analyze the capabilities of Transformer language models in learning compositional discrete tasks.

Paper
Add Code

TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing

no code implementations • 9 Dec 2023 • Aleksandar Terzic, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi

We build upon their approach by replacing the linear recurrence with a special temporal convolutional network which permits larger receptive field size with shallower networks, and reduces the computational complexity to $O(L)$.

Language Modelling

Paper
Add Code

Factorizers for Distributed Sparse Block Codes

no code implementations • 24 Mar 2023 • Michael Hersche, Aleksandar Terzic, Geethan Karunaratne, Jovin Langenegger, Angéline Pouget, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

Secondly, the proposed factorizer maintains a high accuracy when queried by noisy product vectors generated using deep convolutional neural networks (CNNs).

Attribute

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.