no code implementations • 30 Jan 2023 • Maksud Sharipov, Elmurod Kuriyozov, Ollabergan Yuldashev, Ogabek Sobirov
This research paper presents a part-of-speech (POS) annotated dataset and tagger tool for the low-resource Uzbek language.
no code implementations • 28 Oct 2022 • Maksud Sharipov, Ogabek Sobirov
This lemmatization consists of the general rules and a part of speech data of the Uzbek language, affixes, classification of affixes, removing affixes on the basis of the finite state machine for each class, as well as a definition of this word lemma.