Search Results for author: Marcelo Gennari do Nascimento

Found 3 papers, 2 papers with code

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

1 code implementation • 26 Jan 2024 • Saleh Ashkboos, Maximilian L. Croci, Marcelo Gennari do Nascimento, Torsten Hoefler, James Hensman

Large language models have become the cornerstone of natural language processing, but their use comes with substantial costs in terms of compute and memory resources.

290

Paper
Code

Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes

1 code implementation • ECCV 2020 • Marcelo Gennari do Nascimento, Theo W. Costain, Victor Adrian Prisacariu

We propose a novel method for neural network quantization that casts the neural architecture search problem as one of hyperparameter search to find non-uniform bit distributions throughout the layers of a CNN.

Gaussian Processes Neural Architecture Search +1

Paper
Code

DSConv: Efficient Convolution Operator

no code implementations • ICCV 2019 • Marcelo Gennari do Nascimento, Roger Fawcett, Victor Adrian Prisacariu

Quantization is a popular way of increasing the speed and lowering the memory usage of Convolution Neural Networks (CNNs).

Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.