Search Results for author: Zhonghan Zhao

Found 6 papers, 0 papers with code

Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model

no code implementations • 6 Apr 2024 • Zhonghan Zhao, Ke Ma, Wenhao Chai, Xuan Wang, Kewei Chen, Dongxu Guo, Yanting Zhang, Hongwei Wang, Gaoang Wang

After distillation, embodied agents can complete complex, open-ended tasks without additional expert guidance, utilizing the performance and knowledge of a versatile MLM.

Knowledge Distillation

Paper
Add Code

Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation

no code implementations • 13 Mar 2024 • Zhonghan Zhao, Kewei Chen, Dongxu Guo, Wenhao Chai, Tian Ye, Yanting Zhang, Gaoang Wang

To assess organizational behavior, we design a series of navigation tasks in the Minecraft environment, which includes searching and exploring.

Navigate

Paper
Add Code

See and Think: Embodied Agent in Virtual Environment

no code implementations • 26 Nov 2023 • Zhonghan Zhao, Wenhao Chai, Xuan Wang, Li Boyi, Shengyu Hao, Shidong Cao, Tian Ye, Jenq-Neng Hwang, Gaoang Wang

Vision perception involves the interpretation of visual information in the environment, which is then integrated into the LLMs component with agent state and task instruction.

Question Answering Retrieval

Paper
Add Code

Devil in the Number: Towards Robust Multi-modality Data Filter

no code implementations • 24 Sep 2023 • Yichen Xu, Zihan Xu, Wenhao Chai, Zhonghan Zhao, Enxin Song, Gaoang Wang

In order to appropriately filter multi-modality data sets on a web-scale, it becomes crucial to employ suitable filtering methods to boost performance and reduce training costs.

Paper
Add Code

UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning

no code implementations • 19 Aug 2023 • Meiqi Sun, Zhonghan Zhao, Wenhao Chai, Hanjun Luo, Shidong Cao, Yanting Zhang, Jenq-Neng Hwang, Gaoang Wang

Our proposed model takes support images and labels as prompt guidance for a query image.

Decoder Few-Shot Learning +1

Paper
Add Code

A Survey of Deep Learning in Sports Applications: Perception, Comprehension, and Decision

no code implementations • 7 Jul 2023 • Zhonghan Zhao, Wenhao Chai, Shengyu Hao, Wenhao Hu, Guanhong Wang, Shidong Cao, Mingli Song, Jenq-Neng Hwang, Gaoang Wang

Deep learning has the potential to revolutionize sports performance, with applications ranging from perception and comprehension to decision.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.