Search Results for author: Qiyang Jiang

Found 1 papers, 1 papers with code

Benchmarking Data Science Agents

1 code implementation27 Feb 2024 Yuge Zhang, Qiyang Jiang, Xingyu Han, Nan Chen, Yuqing Yang, Kan Ren

In this paper, we introduce DSEval -- a novel evaluation paradigm, as well as a series of innovative benchmarks tailored for assessing the performance of these agents throughout the entire data science lifecycle.

Benchmarking Code Generation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.