Search Results for author: Peerat Limkonchotiwat

Found 10 papers, 9 papers with code

Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble

1 code implementation • EMNLP 2020 • Peerat Limkonchotiwat, Wannaphong Phatthiyaphaibun, Raheem Sarwar, Ekapol Chuangsuwanich, Sarana Nutanong

Like many Natural Language Processing tasks, Thai word segmentation is domain-dependent.

Ranked #1 on Thai Word Segmentation on WS160 (using extra training data)

Domain Adaptation Ensemble Learning +2

Paper
Code

Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation

1 code implementation • Findings (ACL) 2021 • Peerat Limkonchotiwat, Wannaphong Phatthiyaphaibun, Raheem Sarwar, Ekapol Chuangsuwanich, Sarana Nutanong

Thai Word Segmentation

Paper
Code

Robust Fragment-Based Framework for Cross-lingual Sentence Retrieval

no code implementations • Findings (EMNLP) 2021 • Nattapol Trijakwanich, Peerat Limkonchotiwat, Raheem Sarwar, Wannaphong Phatthiyaphaibun, Ekapol Chuangsuwanich, Sarana Nutanong

Cross-lingual Sentence Retrieval (CLSR) aims at retrieving parallel sentence pairs that are translations of each other from a multilingual set of comparable documents.

Machine Translation Retrieval +4

Paper
Add Code

Thai Nested Named Entity Recognition Corpus

1 code implementation • Findings (ACL) 2022 • Weerayut Buaphet, Can Udomcharoenchaikit, Peerat Limkonchotiwat, Attapol Rutherford, Sarana Nutanong

Our work, to the best of our knowledge, presents the largest non-English N-NER dataset and the first non-English one with fine-grained classes.

Language Modelling named-entity-recognition +3

Paper
Code

CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering

1 code implementation • Findings (NAACL) 2022 • Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong

A common approach to CL-ReQA is to create a multilingual sentence embedding space such that question-answer pairs across different languages are close to each other.

Language Modelling Question Answering +6