Search Results for author: Sijia Cheng

Found 2 papers, 0 papers with code

A Survey of Useful LLM Evaluation

no code implementations • 3 Jun 2024 • Ji-Lun Peng, Sijia Cheng, Egil Diau, Yung-Yu Shih, Po-Heng Chen, Yen-Ting Lin, Yun-Nung Chen

We proposed the two-stage framework: from ``core ability'' to ``agent'', clearly explaining how LLMs can be applied based on their specific capabilities, along with the evaluation methods in each stage.

Paper
Add Code

Measuring Taiwanese Mandarin Language Understanding

no code implementations • 29 Mar 2024 • Po-Heng Chen, Sijia Cheng, Wei-Lin Chen, Yen-Ting Lin, Yun-Nung Chen

We present TMLU, a holistic evaluation suit tailored for assessing the advanced knowledge and reasoning capability in LLMs, under the context of Taiwanese Mandarin.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.