Search Results for author: Hanjin Choi

Found 1 papers, 0 papers with code

Failure Tolerant Training with Persistent Memory Disaggregation over CXL

no code implementations • 14 Jan 2023 • Miryeong Kwon, Junhyeok Jang, Hanjin Choi, Sangwon Lee, Myoungsoo Jung

This paper proposes TRAININGCXL that can efficiently process large-scale recommendation datasets in the pool of disaggregated memory while making training fault tolerant with low overhead.

Recommendation Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.