Search Results for author: Tianle Zhong

Found 2 papers, 1 papers with code

RINAS: Training with Dataset Shuffling Can Be General and Fast

no code implementations4 Dec 2023 Tianle Zhong, Jiechen Zhao, Xindi Guo, Qiang Su, Geoffrey Fox

However, loading shuffled data for large datasets incurs significant overhead in the deep learning pipeline and severely impacts the end-to-end training throughput.

Language Modelling

RTP: Rethinking Tensor Parallelism with Memory Deduplication

1 code implementation2 Nov 2023 Cheng Luo, Tianle Zhong, Geoffrey Fox

In the evolving landscape of neural network models, one prominent challenge stand out: the significant memory overheads associated with training expansive models.

Cannot find the paper you are looking for? You can Submit a new open access paper.