no code implementations • 2 May 2024 • Daniel Coquelin, Katherina Flügel, Marie Weiel, Nicholas Kiefer, Muhammed Öz, Charlotte Debus, Achim Streit, Markus Götz
Communication bottlenecks hinder the scalability of distributed neural network training, particularly on distributed-memory computing clusters.