no code implementations • 8 Dec 2021 • Keyu Yang, Lu Chen, Zhihao Zeng, Yunjun Gao
Distributed ML models trained by SGD involve large amounts of gradient communication, which limits the scalability of distributed ML.
BIG-bench Machine Learning Quantization