Search Results for author: Scott Sievert

Found 2 papers, 1 papers with code

Improving the convergence of SGD through adaptive batch sizes

no code implementations • 18 Oct 2019 • Scott Sievert, Shrey Shah

This work presents a method to adapt the batch size to the model's training loss.

Paper
Add Code

ATOMO: Communication-efficient Learning via Atomic Sparsification

1 code implementation • NeurIPS 2018 • Hongyi Wang, Scott Sievert, Zachary Charles, Shengchao Liu, Stephen Wright, Dimitris Papailiopoulos

We present ATOMO, a general framework for atomic sparsification of stochastic gradients.

25

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.