Search Results for author: Dong Nguyen

Using this new framework, we design a Transformer-based user model that can produce high-quality general-purpose user representations for instant messaging platforms like Snapchat.

Representation Learning

Paper
Add Code

FTFT: Efficient and Robust Fine-Tuning by Transferring Training Dynamics

1 code implementation • 10 Oct 2023 • Yupei Du, Albert Gatt, Dong Nguyen

Dataset cartography is a simple yet effective dual-model approach that improves the robustness of fine-tuned PLMs.

Paper
Code

Epicurus at SemEval-2023 Task 4: Improving Prediction of Human Values behind Arguments by Leveraging Their Definitions

1 code implementation • 27 Feb 2023 • Christian Fang, Qixiang Fang, Dong Nguyen

We describe our experiments for SemEval-2023 Task 4 on the identification of human values behind arguments (ValueEval).

Paper
Code

Measuring the Instability of Fine-Tuning

1 code implementation • 15 Feb 2023 • Yupei Du, Dong Nguyen

Fine-tuning pre-trained language models on downstream tasks with varying random seeds has been shown to be unstable, especially on small datasets.

Paper
Code

Template-based Abstractive Microblog Opinion Summarisation

no code implementations • 8 Aug 2022 • Iman Munire Bilal, Bo wang, Adam Tsakalidis, Dong Nguyen, Rob Procter, Maria Liakata

We introduce the task of microblog opinion summarisation (MOS) and share a dataset of 3100 gold-standard opinion summaries to facilitate research in this domain.

Paper
Add Code

Same Author or Just Same Topic? Towards Content-Independent Style Representations

1 code implementation • RepL4NLP (ACL) 2022 • Anna Wegmann, Marijn Schraagen, Dong Nguyen

Linguistic style is an integral component of language.

Authorship Verification

Paper
Code

Evaluating the Construct Validity of Text Embeddings with Application to Survey Questions

1 code implementation • 18 Feb 2022 • Qixiang Fang, Dong Nguyen, Daniel L Oberski

Our results thus highlight the necessity to examine the construct validity of text embeddings before deploying them in social science research.

Sentence valid

Paper
Code

Understanding Public Opinion on Using Hydroxychloroquine for COVID-19 Treatment via Social Media

2 code implementations • 1 Jan 2022 • Thuy T. Do, Du Nguyen, Anh Le, Anh Nguyen, Dong Nguyen, Nga Hoang, Uyen Le, Tuan Tran

This paper studies the reactions of social network users on the recommendation of using HCQ for COVID-19 treatment by analyzing the reaction patterns and sentiment of the tweets.

Descriptive Sentiment Analysis

Paper
Code

Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework

1 code implementation • EMNLP 2021 • Anna Wegmann, Dong Nguyen

Style is an integral part of natural language.

Paper
Code

Assessing the Reliability of Word Embedding Gender Bias Measures

1 code implementation • EMNLP 2021 • Yupei Du, Qixiang Fang, Dong Nguyen

In this paper, we assess three types of reliability of word embedding gender bias measures, namely test-retest reliability, inter-rater consistency and internal consistency.

Word Embeddings

Paper
Code

On learning and representing social meaning in NLP: a sociolinguistic perspective

no code implementations • NAACL 2021 • Dong Nguyen, Laura Rosseel, Jack Grieve

The field of NLP has made substantial progress in building meaning representations.

Representation Learning

Paper
Add Code

Introducing CAD: the Contextual Abuse Dataset

1 code implementation • NAACL 2021 • Bertie Vidgen, Dong Nguyen, Helen Margetts, Patricia Rossini, Rebekah Tromble

Online abuse can inflict harm on users and communities, making online spaces unsafe and toxic.

Paper
Code

Semantic Journeys: Quantifying Change in Emoji Meaning from 2012-2018

1 code implementation • 3 May 2021 • Alexander Robertson, Farhana Ferdousi Liza, Dong Nguyen, Barbara McGillivray, Scott A. Hale

The semantics of emoji has, to date, been considered from a static perspective.

Paper
Code

HateCheck: Functional Tests for Hate Speech Detection Models

3 code implementations • ACL 2021 • Paul Röttger, Bertram Vidgen, Dong Nguyen, Zeerak Waseem, Helen Margetts, Janet B. Pierrehumbert

Detecting online hate is a difficult task that even state-of-the-art models struggle with.

Hate Speech Detection

Paper
Code

Do Word Embeddings Capture Spelling Variation?

1 code implementation • COLING 2020 • Dong Nguyen, Jack Grieve

Analyses of word embeddings have primarily focused on semantic and syntactic properties.

Word Embeddings

Paper
Code

Better Early than Late: Fusing Topics with Word Embeddings for Neural Question Paraphrase Identification

no code implementations • 22 Jul 2020 • Nicole Peinelt, Dong Nguyen, Maria Liakata

Question paraphrase identification is a key task in Community Question Answering (CQA) to determine if an incoming question has been previously asked.

Community Question Answering Paraphrase Identification +2

Paper
Add Code

tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection

1 code implementation • ACL 2020 • Nicole Peinelt, Dong Nguyen, Maria Liakata

Semantic similarity detection is a fundamental task in natural language understanding.

Natural Language Understanding Semantic Similarity +2

140

Paper
Code

Room to Glo: A Systematic Comparison of Semantic Change Detection Approaches with Word Embeddings

no code implementations • IJCNLP 2019 • Philippa Shoemark, Farhana Ferdousi Liza, Dong Nguyen, Scott Hale, Barbara McGillivray

Word embeddings are increasingly used for the automatic detection of semantic change; yet, a robust evaluation and systematic comparison of the choices involved has been lacking.

Change Detection Time Series +2

Paper
Add Code

Challenges and frontiers in abusive content detection

1 code implementation • WS 2019 • Bertie Vidgen, Alex Harris, Dong Nguyen, Rebekah Tromble, Scott Hale, Helen Margetts

Online abusive content detection is an inherently difficult task.

Abuse Detection

Paper
Code

How we do things with words: Analyzing text as social and cultural data

no code implementations • 2 Jul 2019 • Dong Nguyen, Maria Liakata, Simon DeDeo, Jacob Eisenstein, David Mimno, Rebekah Tromble, Jane Winters

Second, we hope to provide a set of best practices for working with thick social and cultural concepts.

Paper
Add Code

Aiming beyond the Obvious: Identifying Non-Obvious Cases in Semantic Similarity Datasets

1 code implementation • ACL 2019 • Nicole Peinelt, Maria Liakata, Dong Nguyen

Existing datasets for scoring text pairs in terms of semantic similarity contain instances whose resolution differs according to the degree of difficulty.

Paper
Code

Comparing Automatic and Human Evaluation of Local Explanations for Text Classification

no code implementations • NAACL 2018 • Dong Nguyen

Text classification models are becoming increasingly complex and opaque, however for many applications it is essential that the models are interpretable.

General Classification Recommendation Systems +2

Paper
Add Code

Emo, Love, and God: Making Sense of Urban Dictionary, a Crowd-Sourced Online Dictionary

no code implementations • 22 Dec 2017 • Dong Nguyen, Barbara McGillivray, Taha Yasseri

On the one hand, the promise of the "wisdom of the crowd" has inspired successful projects such as Wikipedia, which has become the primary source of crowd-based information in many languages.