Search Results for author: Haozheng Fan

Found 3 papers, 1 papers with code

HLAT: High-quality Large Language Model Pre-trained on AWS Trainium

no code implementations • 16 Apr 2024 • Haozheng Fan, Hao Zhou, Guangtai Huang, Parameswaran Raman, Xinwei Fu, Gaurav Gupta, Dhananjay Ram, Yida Wang, Jun Huan

In this paper, we showcase HLAT: a 7 billion parameter decoder-only LLM pre-trained using trn1 instances over 1. 8 trillion tokens.

Language Modelling Large Language Model

Paper
Add Code

RAF: Holistic Compilation for Deep Learning Model Training

1 code implementation • 8 Mar 2023 • Cody Hao Yu, Haozheng Fan, Guangtai Huang, Zhen Jia, Yizhi Liu, Jie Wang, Zach Zheng, Yuan Zhou, Haichen Shen, Junru Shao, Mu Li, Yida Wang

In this paper, we present RAF, a deep learning compiler for training.

Graph Generation

135

Paper
Code

Effective Decoding in Graph Auto-Encoder using Triadic Closure

no code implementations • 26 Nov 2019 • Han Shi, Haozheng Fan, James T. Kwok

We propose the triad decoder, which considers and predicts the three edges involved in a local triad together.

Clustering Decoder +5

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.