Search Results for author: Feng Sun

Found 11 papers, 3 papers with code

ResLoRA: Identity Residual Mapping in Low-Rank Adaption

1 code implementation • 28 Feb 2024 • Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

As one of the most popular parameter-efficient fine-tuning (PEFT) methods, low-rank adaptation (LoRA) is commonly applied to fine-tune large language models (LLMs).

3,211

Paper
Code

HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition

no code implementations • 24 Feb 2024 • Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Large language models (LLMs) have emerged as a promising alternative to expensive human evaluations.

Language Modelling Large Language Model

Paper
Add Code

Text Diffusion with Reinforced Conditioning

no code implementations • 19 Feb 2024 • Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Diffusion models have demonstrated exceptional capability in generating high-quality images, videos, and audio.

Paper
Add Code

Improving Domain Adaptation through Extended-Text Reading Comprehension

1 code implementation • 14 Jan 2024 • Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

To enhance the domain-specific capabilities of large language models, continued pre-training on a domain-specific corpus is a prevalent method.

Clustering Domain Adaptation +1

3,211

Paper
Code

Democratizing Reasoning Ability: Tailored Learning from Large Language Model

1 code implementation • 20 Oct 2023 • Zhaoyang Wang, Shaohan Huang, Yuxuan Liu, Jiahai Wang, Minghui Song, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

In this paper, we propose a tailored learning approach to distill such reasoning ability to smaller LMs to facilitate the democratization of the exclusive reasoning ability.

Instruction Following Language Modelling +1

Paper
Code

Auto Search Indexer for End-to-End Document Retrieval

no code implementations • 19 Oct 2023 • Tianchi Yang, Minghui Song, Zihan Zhang, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang

Generative retrieval, which is a new advanced paradigm for document retrieval, has recently attracted research interests, since it encodes all documents into the model and directly generates the retrieved documents.

Decoder Retrieval

Paper
Add Code

Calibrating LLM-Based Evaluator

no code implementations • 23 Sep 2023 • Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Recent advancements in large language models (LLMs) on language modeling and emergent capabilities make them a promising reference-free evaluator of natural language generation quality, and a competent alternative to human evaluation.

In-Context Learning Language Modelling +1

Paper
Add Code

A Deep Model for Partial Multi-Label Image Classification with Curriculum Based Disambiguation

no code implementations • 6 Jul 2022 • Feng Sun, Ming-Kun Xie, Sheng-Jun Huang

In this paper, we study the partial multi-label (PML) image classification problem, where each image is annotated with a candidate label set consists of multiple relevant labels and other noisy labels.

Multi-Label Image Classification

Paper
Add Code

MMINR: Multi-frame-to-Multi-frame Inference with Noise Resistance for Precipitation Nowcasting with Radar

no code implementations • 5 May 2022 • Feng Sun, Cong Bai

To address this problem, we propose a novel Multi-frame-to-Multi-frame Inference (MMI) model with Noise Resistance (NR) named MMINR.

Paper
Add Code

Rainformer: Features Extraction Balanced Network for Radar-Based Precipitation Nowcasting

no code implementations • IEEE Geoscience and Remote Sensing Letters 2022 • Cong Bai, Feng Sun, Jinglin Zhang, Yi Song, ShengYong Chen

The experimental results show that Rainformer outperforms seven state of the arts methods on the benchmark database and provides more insights into the high-intensity rainfall prediction task.

Ranked #5 on Weather Forecasting on SEVIR

Weather Forecasting

Paper
Add Code

RRLFSOR: An Efficient Self-Supervised Learning Strategy of Graph Convolutional Networks

no code implementations • 17 Aug 2021 • Feng Sun, Ajith Kumar V, Guanci Yang, Qikui Zhu, Yiyun Zhang, Ansi Zhang, Dhruv Makwana

Graph Convolutional Networks (GCNs) are widely used in many applications yet still need large amounts of labelled data for training.

Link Prediction Self-Learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.