Search Results for author: Khiem Le

Found 2 papers, 1 papers with code

Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization

no code implementations22 Mar 2024 Khiem Le, Long Ho, Cuong Do, Danh Le-Phuoc, Kok-Seng Wong

Domain shift is a formidable issue in Machine Learning that causes a model to suffer from performance degradation when tested on unseen domains.

Domain Generalization Federated Learning +1

HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts

1 code implementation12 Dec 2023 Giang Do, Khiem Le, Quang Pham, TrungTin Nguyen, Thanh-Nam Doan, Bint T. Nguyen, Chenghao Liu, Savitha Ramasamy, XiaoLi Li, Steven Hoi

By routing input tokens to only a few split experts, Sparse Mixture-of-Experts has enabled efficient training of large language models.

Cannot find the paper you are looking for? You can Submit a new open access paper.