Search Results for author: Nikita Trukhanov

Found 1 papers, 0 papers with code

Accurate Block Quantization in LLMs with Outliers

no code implementations29 Mar 2024 Nikita Trukhanov, Ilya Soloveychik

It made evident the colossal shortage of dedicated hardware capable of efficient and fast processing of the involved compute and memory movement.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.