no code implementations • 22 Feb 2021 • Usama Khalid, Mirza Omer Beg, Muhammad Umair Arshad
We train Monolingual, Multilingual, and Bilingual models of Roman Urdu - the proposed bilingual model achieves 23% accuracy compared to the 2% and 11% of the monolingual and multilingual models respectively in the Masked Language Modeling (MLM) task.
1 code implementation • 22 Feb 2021 • Usama Khalid, Aizaz Hussain, Muhammad Umair Arshad, Waseem Shahzad, Mirza Omer Beg
In this paper, we have built a corpus for Urdu by scraping and integrating data from various sources and compiled a vocabulary for the Urdu language.
no code implementations • 22 Feb 2021 • Usama Khalid, Mirza Omer Beg
Information verification is quite a challenging task, this is because many times verifying a claim can require picking pieces of information from multiple pieces of evidence which can have a hierarchy of complex semantic relations.
no code implementations • 22 Feb 2021 • Usama Khalid, Mirza Omer Beg, Muhammad Umair Arshad
It is also a well-known fact that training and maintaining monolingual models for each language is a costly and time-consuming process.