Search Results for author: Zengzhi Wang

Found 6 papers, 5 papers with code

Benchmarking Benchmark Leakage in Large Language Models

1 code implementation29 Apr 2024 Ruijie Xu, Zengzhi Wang, Run-Ze Fan, PengFei Liu

By analyzing 31 LLMs under the context of mathematical reasoning, we reveal substantial instances of training even test set misuse, resulting in potentially unfair comparisons.

Benchmarking Mathematical Reasoning

Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math

1 code implementation28 Dec 2023 Zengzhi Wang, Rui Xia, PengFei Liu

Our meticulous data collection and processing efforts included a complex suite of preprocessing, prefiltering, language identification, cleaning, filtering, and deduplication, ensuring the high quality of our corpus.

Language Identification Math +1

Ask Again, Then Fail: Large Language Models' Vacillations in Judgement

1 code implementation3 Oct 2023 Qiming Xie, Zengzhi Wang, Yi Feng, Rui Xia

We observe that current conversational language models often waver in their judgements when faced with follow-up questions, even if the original judgement was correct.

Negation

MEMD-ABSA: A Multi-Element Multi-Domain Dataset for Aspect-Based Sentiment Analysis

1 code implementation29 Jun 2023 Hongjie Cai, Nan Song, Zengzhi Wang, Qiming Xie, Qiankun Zhao, Ke Li, Siwei Wu, Shijie Liu, Jianfei Yu, Rui Xia

Aspect-based sentiment analysis is a long-standing research interest in the field of opinion mining, and in recent years, researchers have gradually shifted their focus from simple ABSA subtasks to end-to-end multi-element ABSA tasks.

Aspect-Based Sentiment Analysis Opinion Mining +1

Cannot find the paper you are looking for? You can Submit a new open access paper.