Search Results for author: Jinkyu Yim

Found 2 papers, 1 papers with code

Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters

1 code implementation • 28 May 2024 • Jinkyu Yim, Jaeyong Song, Yerim Choi, Jaebeen Lee, Jaewon Jung, Hongsun Jang, Jinho Lee

In addition, they often fail to consider the memory requirement per GPU, often recommending solutions that could not be executed.

Language Modelling Large Language Model

Paper
Code

Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression

no code implementations • 24 Jan 2023 • Jaeyong Song, Jinkyu Yim, Jaewon Jung, Hongsun Jang, Hyung-Jin Kim, Youngsok Kim, Jinho Lee

Compressing the communication is one way to mitigate the overhead by reducing the inter-node traffic volume; however, the existing compression techniques have critical limitations to be applied for NLP models with 3D parallelism in that 1) only the data parallelism traffic is targeted, and 2) the existing compression schemes already harm the model quality too much.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.