Search Results for author: Henry Hengyuan Zhao

Found 2 papers, 2 papers with code

Genixer: Empowering Multimodal Large Language Models as a Powerful Data Generator

1 code implementation • 11 Dec 2023 • Henry Hengyuan Zhao, Pan Zhou, Mike Zheng Shou

Multimodal Large Language Models (MLLMs) demonstrate exceptional problem-solving capabilities, but there is limited research focusing on their ability to generate data by converting unlabeled images into visual instruction tuning data.

Image Captioning Question Answering +1

Paper
Code

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels

2 code implementations • 15 Sep 2023 • Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou

Experiments on 19 visual transfer learning downstream tasks demonstrate that our SCT outperforms full fine-tuning on 18 out of 19 tasks by adding only 0. 11M parameters of the ViT-B, which is 780$\times$ fewer than its full fine-tuning counterpart.

Domain Generalization Few-Shot Learning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.