Search Results for author: Longxu Dou

Found 16 papers, 7 papers with code

Sailor: Open Language Models for South-East Asia

1 code implementation • 4 Apr 2024 • Longxu Dou, Qian Liu, Guangtao Zeng, Jia Guo, Jiahui Zhou, Wei Lu, Min Lin

We present Sailor, a family of open language models ranging from 0. 5B to 7B parameters, tailored for South-East Asian (SEA) languages.

Language Modelling Question Answering +1

Paper
Code

Multi-Hop Table Retrieval for Open-Domain Text-to-SQL

no code implementations • 16 Feb 2024 • Xuanliang Zhang, Dingzirui Wang, Longxu Dou, Qingfu Zhu, Wanxiang Che

To reduce the effect of the similar irrelevant entity, our method focuses on unretrieved entities at each hop and considers the low-ranked tables by beam search.

Table Retrieval Text-To-SQL

Paper
Add Code

Enhancing Numerical Reasoning with the Guidance of Reliable Reasoning Processes

no code implementations • 16 Feb 2024 • Dingzirui Wang, Longxu Dou, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che

Numerical reasoning is an essential ability for NLP systems to handle numeric information.

Paper
Add Code

Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL

no code implementations • 16 Feb 2024 • Dingzirui Wang, Longxu Dou, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che

Currently, the in-context learning method based on large language models (LLMs) has become the mainstream of text-to-SQL research.

In-Context Learning Text-To-SQL

Paper
Add Code

A Survey of Table Reasoning with Large Language Models

1 code implementation • 13 Feb 2024 • Xuanliang Zhang, Dingzirui Wang, Longxu Dou, Qingfu Zhu, Wanxiang Che

In this paper, we analyze the mainstream techniques used to improve table reasoning performance in the LLM era, and the advantages of LLMs compared to pre-LLMs for solving table reasoning.

Paper
Code

Exploring Equation as a Better Intermediate Meaning Representation for Numerical Reasoning

1 code implementation • 21 Aug 2023 • Dingzirui Wang, Longxu Dou, Wenbin Zhang, Junyu Zeng, Wanxiang Che

So in this paper, we try to use equations as IMRs to solve the numerical reasoning task by addressing two problems: (1) Theoretically, how to prove that the equation is an IMR with higher generation accuracy than programs; (2) Empirically, how to improve the generation accuracy of equations with LLMs.

GSM8K

Paper
Code

Controllable Data Augmentation for Context-Dependent Text-to-SQL

no code implementations • 27 Apr 2023 • Dingzirui Wang, Longxu Dou, Wanxiang Che

In this paper, we introduce ConDA, which generates interactive questions and corresponding SQL results.

Data Augmentation Text-To-SQL

Paper
Add Code

MixPro: Simple yet Effective Data Augmentation for Prompt-based Learning

no code implementations • 19 Apr 2023 • Bohan Li, Longxu Dou, Yutai Hou, Yunlong Feng, Honglin Mu, Qingfu Zhu, Qinghua Sun, Wanxiang Che

Prompt-based learning has shown considerable promise in reformulating various downstream tasks as cloze problems by combining original input with a predetermined template.

Data Augmentation Few-Shot Learning +1

Paper
Add Code

From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning

1 code implementation • 17 Apr 2023 • Qian Liu, Fan Zhou, Zhengbao Jiang, Longxu Dou, Min Lin

Empirical results on various benchmarks validate that the integration of SQL execution leads to significant improvements in zero-shot scenarios, particularly in table reasoning.

Zero-shot Generalization

Paper
Code

Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge

1 code implementation • 3 Jan 2023 • Longxu Dou, Yan Gao, Xuqi Liu, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Dechen Zhan, Min-Yen Kan, Jian-Guang Lou

In this paper, we study the problem of knowledge-intensive text-to-SQL, in which domain knowledge is necessary to parse expert questions into SQL queries over domain-specific tables.

Semantic Parsing Text-To-SQL

361

Paper
Code

A Survey on Table-and-Text HybridQA: Concepts, Methods, Challenges and Future Directions

no code implementations • 27 Dec 2022 • Dingzirui Wang, Longxu Dou, Wanxiang Che

Table-and-text hybrid question answering (HybridQA) is a widely used and challenging NLP task commonly applied in the financial and scientific domain.

Question Answering

Paper
Add Code

MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing

1 code implementation • 27 Dec 2022 • Longxu Dou, Yan Gao, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Dechen Zhan, Jian-Guang Lou

Text-to-SQL semantic parsing is an important NLP task, which greatly facilitates the interaction between users and the database and becomes the key component in many human-computer interaction systems.

Benchmarking Semantic Parsing +1

361

Paper
Code

UniSAr: A Unified Structure-Aware Autoregressive Language Model for Text-to-SQL

1 code implementation • 15 Mar 2022 • Longxu Dou, Yan Gao, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Dechen Zhan, Jian-Guang Lou

Existing text-to-SQL semantic parsers are typically designed for particular settings such as handling queries that span multiple tables, domains or turns which makes them ineffective when applied to different settings.

Language Modelling Text-To-SQL

361

Paper
Code

HIT-SCIR at MRP 2020: Transition-based Parser and Iterative Inference Parser

no code implementations • CONLL 2020 • Longxu Dou, Yunlong Feng, Yuqiu Ji, Wanxiang Che, Ting Liu

This paper describes our submission system (HIT-SCIR) for the CoNLL 2020 shared task: Cross-Framework and Cross-Lingual Meaning Representation Parsing.