Search Results for author: Koki Shibata

Found 1 papers, 1 papers with code

How do different tokenizers perform on downstream tasks in scriptio continua languages?: A case study in Japanese

1 code implementation16 Jun 2023 Takuro Fujii, Koki Shibata, Atsuki Yamaguchi, Terufumi Morishita, Yasuhiro Sogawa

This paper investigates the effect of tokenizers on the downstream performance of pretrained language models (PLMs) in scriptio continua languages where no explicit spaces exist between words, using Japanese as a case study.

Cannot find the paper you are looking for? You can Submit a new open access paper.