Search Results for author: Marcelo Gennari do Nascimento

Found 3 papers, 2 papers with code

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

1 code implementation26 Jan 2024 Saleh Ashkboos, Maximilian L. Croci, Marcelo Gennari do Nascimento, Torsten Hoefler, James Hensman

Large language models have become the cornerstone of natural language processing, but their use comes with substantial costs in terms of compute and memory resources.

Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes

1 code implementation ECCV 2020 Marcelo Gennari do Nascimento, Theo W. Costain, Victor Adrian Prisacariu

We propose a novel method for neural network quantization that casts the neural architecture search problem as one of hyperparameter search to find non-uniform bit distributions throughout the layers of a CNN.

Gaussian Processes Neural Architecture Search +1

DSConv: Efficient Convolution Operator

no code implementations ICCV 2019 Marcelo Gennari do Nascimento, Roger Fawcett, Victor Adrian Prisacariu

Quantization is a popular way of increasing the speed and lowering the memory usage of Convolution Neural Networks (CNNs).

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.