1 code implementation • 30 Apr 2024 • Goya van Boven, Yupei Du, Dong Nguyen
We further show that CDA remains effective in low-resource settings, in which a limited set of debiasing documents is used.
no code implementations • 10 Apr 2024 • Anna Wegmann, Tijs van den Broek, Dong Nguyen
We introduce a dataset with utterance pairs from NPR and CNN news interviews annotated for context-dependent paraphrases.
no code implementations • 2 Apr 2024 • Qixiang Fang, Daniel L. Oberski, Dong Nguyen
Third, we release 4 datasets to support measuring and comparing LLM proficiency in grade school mathematics and science against human populations.
no code implementations • 19 Dec 2023 • Qixiang Fang, Zhihan Zhou, Francesco Barbieri, Yozen Liu, Leonardo Neves, Dong Nguyen, Daniel L. Oberski, Maarten W. Bos, Ron Dotsch
Using this new framework, we design a Transformer-based user model that can produce high-quality general-purpose user representations for instant messaging platforms like Snapchat.
1 code implementation • 10 Oct 2023 • Yupei Du, Albert Gatt, Dong Nguyen
Dataset cartography is a simple yet effective dual-model approach that improves the robustness of fine-tuned PLMs.
1 code implementation • 27 Feb 2023 • Christian Fang, Qixiang Fang, Dong Nguyen
We describe our experiments for SemEval-2023 Task 4 on the identification of human values behind arguments (ValueEval).
1 code implementation • 15 Feb 2023 • Yupei Du, Dong Nguyen
Fine-tuning pre-trained language models on downstream tasks with varying random seeds has been shown to be unstable, especially on small datasets.
no code implementations • 8 Aug 2022 • Iman Munire Bilal, Bo wang, Adam Tsakalidis, Dong Nguyen, Rob Procter, Maria Liakata
We introduce the task of microblog opinion summarisation (MOS) and share a dataset of 3100 gold-standard opinion summaries to facilitate research in this domain.
1 code implementation • RepL4NLP (ACL) 2022 • Anna Wegmann, Marijn Schraagen, Dong Nguyen
Linguistic style is an integral component of language.
1 code implementation • 18 Feb 2022 • Qixiang Fang, Dong Nguyen, Daniel L Oberski
Our results thus highlight the necessity to examine the construct validity of text embeddings before deploying them in social science research.
2 code implementations • 1 Jan 2022 • Thuy T. Do, Du Nguyen, Anh Le, Anh Nguyen, Dong Nguyen, Nga Hoang, Uyen Le, Tuan Tran
This paper studies the reactions of social network users on the recommendation of using HCQ for COVID-19 treatment by analyzing the reaction patterns and sentiment of the tweets.
1 code implementation • EMNLP 2021 • Anna Wegmann, Dong Nguyen
Style is an integral part of natural language.
1 code implementation • EMNLP 2021 • Yupei Du, Qixiang Fang, Dong Nguyen
In this paper, we assess three types of reliability of word embedding gender bias measures, namely test-retest reliability, inter-rater consistency and internal consistency.
no code implementations • NAACL 2021 • Dong Nguyen, Laura Rosseel, Jack Grieve
The field of NLP has made substantial progress in building meaning representations.
1 code implementation • NAACL 2021 • Bertie Vidgen, Dong Nguyen, Helen Margetts, Patricia Rossini, Rebekah Tromble
Online abuse can inflict harm on users and communities, making online spaces unsafe and toxic.
1 code implementation • 3 May 2021 • Alexander Robertson, Farhana Ferdousi Liza, Dong Nguyen, Barbara McGillivray, Scott A. Hale
The semantics of emoji has, to date, been considered from a static perspective.
3 code implementations • ACL 2021 • Paul Röttger, Bertram Vidgen, Dong Nguyen, Zeerak Waseem, Helen Margetts, Janet B. Pierrehumbert
Detecting online hate is a difficult task that even state-of-the-art models struggle with.
1 code implementation • COLING 2020 • Dong Nguyen, Jack Grieve
Analyses of word embeddings have primarily focused on semantic and syntactic properties.
no code implementations • 22 Jul 2020 • Nicole Peinelt, Dong Nguyen, Maria Liakata
Question paraphrase identification is a key task in Community Question Answering (CQA) to determine if an incoming question has been previously asked.
1 code implementation • ACL 2020 • Nicole Peinelt, Dong Nguyen, Maria Liakata
Semantic similarity detection is a fundamental task in natural language understanding.
no code implementations • IJCNLP 2019 • Philippa Shoemark, Farhana Ferdousi Liza, Dong Nguyen, Scott Hale, Barbara McGillivray
Word embeddings are increasingly used for the automatic detection of semantic change; yet, a robust evaluation and systematic comparison of the choices involved has been lacking.
1 code implementation • WS 2019 • Bertie Vidgen, Alex Harris, Dong Nguyen, Rebekah Tromble, Scott Hale, Helen Margetts
Online abusive content detection is an inherently difficult task.
no code implementations • 2 Jul 2019 • Dong Nguyen, Maria Liakata, Simon DeDeo, Jacob Eisenstein, David Mimno, Rebekah Tromble, Jane Winters
Second, we hope to provide a set of best practices for working with thick social and cultural concepts.
1 code implementation • ACL 2019 • Nicole Peinelt, Maria Liakata, Dong Nguyen
Existing datasets for scoring text pairs in terms of semantic similarity contain instances whose resolution differs according to the degree of difficulty.
no code implementations • NAACL 2018 • Dong Nguyen
Text classification models are becoming increasingly complex and opaque, however for many applications it is essential that the models are interpretable.
no code implementations • 22 Dec 2017 • Dong Nguyen, Barbara McGillivray, Taha Yasseri
On the one hand, the promise of the "wisdom of the crowd" has inspired successful projects such as Wikipedia, which has become the primary source of crowd-based information in many languages.
1 code implementation • CL 2017 • Dong Nguyen, Jacob Eisenstein
Quantifying the degree of spatial dependence for linguistic variables is a key task for analyzing dialectal variation.
no code implementations • 30 Aug 2015 • Dong Nguyen, A. Seza Doğruöz, Carolyn P. Rosé, Franciska de Jong
Language is a social phenomenon and variation is inherent to its social nature.