Search Results for author: Sijia Cheng

Found 2 papers, 0 papers with code

A Survey of Useful LLM Evaluation

no code implementations3 Jun 2024 Ji-Lun Peng, Sijia Cheng, Egil Diau, Yung-Yu Shih, Po-Heng Chen, Yen-Ting Lin, Yun-Nung Chen

We proposed the two-stage framework: from ``core ability'' to ``agent'', clearly explaining how LLMs can be applied based on their specific capabilities, along with the evaluation methods in each stage.

Measuring Taiwanese Mandarin Language Understanding

no code implementations29 Mar 2024 Po-Heng Chen, Sijia Cheng, Wei-Lin Chen, Yen-Ting Lin, Yun-Nung Chen

We present TMLU, a holistic evaluation suit tailored for assessing the advanced knowledge and reasoning capability in LLMs, under the context of Taiwanese Mandarin.

Cannot find the paper you are looking for? You can Submit a new open access paper.