Search Results for author: Thien Huu Nguyen

Found 81 papers, 16 papers with code

Hierarchical Graph Convolutional Networks for Jointly Resolving Cross-document Coreference of Entity and Event Mentions

no code implementations • NAACL (TextGraphs) 2021 • Duy Phung, Tuan Ngo Nguyen, Thien Huu Nguyen

Prior work has demonstrated the benefits of the predicate-argument information and document context for resolving the coreference of event mentions.

coreference-resolution Event Coreference Resolution +1

Paper
Add Code

Unsupervised Domain Adaptation for Text Classification via Meta Self-Paced Learning

no code implementations • COLING 2022 • Nghia Ngo Trung, Linh Ngo Van, Thien Huu Nguyen

A shift in data distribution can have a significant impact on performance of a text classification model.

Meta-Learning text-classification +2

Paper
Add Code

Parameter-Efficient Domain Knowledge Integration from Multiple Sources for Biomedical Pre-trained Language Models

no code implementations • Findings (EMNLP) 2021 • Qiuhao Lu, Dejing Dou, Thien Huu Nguyen

These knowledge adapters are pre-trained for individual domain knowledge sources and integrated via an attention-based knowledge controller to enrich PLMs.

Self-Supervised Learning

Paper
Add Code

Learning Cross-lingual Representations for Event Coreference Resolution with Multi-view Alignment and Optimal Transport

no code implementations • EMNLP (MRL) 2021 • Duy Phung, Hieu Minh Tran, Minh Van Nguyen, Thien Huu Nguyen

We study a new problem of cross-lingual transfer learning for event coreference resolution (ECR) where models trained on data from a source language are adapted for evaluations in different target languages.

coreference-resolution Cross-Lingual Transfer +4

Paper
Add Code

MECI: A Multilingual Dataset for Event Causality Identification

1 code implementation • COLING 2022 • Viet Dac Lai, Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen

Our dataset thus enable a new research direction on cross-lingual transfer learning for ECI.

Cross-Lingual Transfer Event Causality Identification +1

Paper
Code

Does It Happen? Multi-hop Path Structures for Event Factuality Prediction with Graph Transformer Networks

no code implementations • WNUT (ACL) 2021 • Duong Le, Thien Huu Nguyen

In this work, we show that the multi-hop paths between the words are also necessary to compute the sentence structures for EFP.

Representation Learning Sentence

Paper
Add Code

Fine-grained Temporal Relation Extraction with Ordered-Neuron LSTM and Graph Convolutional Networks

no code implementations • WNUT (ACL) 2021 • Minh Tran Phu, Minh Van Nguyen, Thien Huu Nguyen

In this work, we propose to fill this gap by introducing novel methods to integrate the syntactic structures into the deep learning models for FineTempRel.

Relation Representation Learning +2

Paper
Add Code

Event Extraction in Video Transcripts

no code implementations • COLING 2022 • Amir Pouran Ben Veyseh, Viet Dac Lai, Franck Dernoncourt, Thien Huu Nguyen

As such, the challenges of EE in informal and noisy texts are not adequately studied.

Event Extraction Retrieval +1

Paper
Add Code

Introducing a New Dataset for Event Detection in Cybersecurity Texts

no code implementations • EMNLP 2020 • Hieu Man Duc Trong, Duc Trong Le, Amir Pouran Ben Veyseh, Thuat Nguyen, Thien Huu Nguyen

Detecting cybersecurity events is necessary to keep us informed about the fast growing number of such events reported in text.

Event Detection

Paper
Add Code

Event Detection: Gate Diversity and Syntactic Importance Scores for Graph Convolution Neural Networks

no code implementations • EMNLP 2020 • Viet Dac Lai, Tuan Ngo Nguyen, Thien Huu Nguyen

Recent studies on event detection (ED) have shown that the syntactic dependency graph can be employed in graph convolution neural networks (GCN) to achieve state-of-the-art performance.

Event Detection

Paper
Add Code

Event Extraction from Historical Texts: A New Dataset for Black Rebellions

no code implementations • Findings (ACL) 2021 • Viet Lai, Minh Van Nguyen, Heidi Kaufman, Thien Huu Nguyen

Event Extraction

Paper
Add Code

Unsupervised Domain Adaptation for Event Detection using Domain-specific Adapters

no code implementations • Findings (ACL) 2021 • Nghia Ngo Trung, Duy Phung, Thien Huu Nguyen

Event Detection Unsupervised Domain Adaptation

Paper
Add Code

Learning Prototype Representations Across Few-Shot Tasks for Event Detection

1 code implementation • EMNLP 2021 • Viet Lai, Franck Dernoncourt, Thien Huu Nguyen

We address the sampling bias and outlier issues in few-shot learning for event detection, a subtask of information extraction.

Event Detection Few-Shot Learning

Paper
Code

Modeling Document-Level Context for Event Detection via Important Context Selection

no code implementations • EMNLP 2021 • Amir Pouran Ben Veyseh, Minh Van Nguyen, Nghia Ngo Trung, Bonan Min, Thien Huu Nguyen

To address this issue, we propose a novel method to model document-level context for ED that dynamically selects relevant sentences in the document for the event prediction of the target sentence.

Event Detection Representation Learning +2

Paper
Add Code

Crosslingual Transfer Learning for Relation and Event Extraction via Word Category and Class Alignments

no code implementations • EMNLP 2021 • Minh Van Nguyen, Tuan Ngo Nguyen, Bonan Min, Thien Huu Nguyen

To address this issue, we propose a novel crosslingual alignment method that leverages class information of REE tasks for representation learning.

Event Extraction Relation +2

Paper
Add Code

Improving Cross-Lingual Transfer for Event Argument Extraction with Language-Universal Sentence Structures

no code implementations • EACL (WANLP) 2021 • Minh Van Nguyen, Thien Huu Nguyen

Previous work on CEAE has shown the cross-lingual benefits of universal dependency trees in capturing shared syntactic structures of sentences across languages.

Cross-Lingual Transfer Event Argument Extraction +4

Paper
Add Code

Keyphrase Prediction from Video Transcripts: New Dataset and Directions

no code implementations • COLING 2022 • Amir Pouran Ben Veyseh, Quan Hung Tran, Seunghyun Yoon, Varun Manjunatha, Hanieh Deilamsalehy, Rajiv Jain, Trung Bui, Walter W. Chang, Franck Dernoncourt, Thien Huu Nguyen

To this end, this work studies new challenges of KP in transcripts of videos, an understudied domain for KP that involves informal texts and non-cohesive presentation styles.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

no code implementations • 17 Sep 2023 • Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

However, when it comes to training datasets for these LLMs, especially the recent state-of-the-art models, they are often not fully disclosed.

Hallucination Language Identification

Paper
Add Code

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

2 code implementations • 29 Jul 2023 • Viet Dac Lai, Chien Van Nguyen, Nghia Trung Ngo, Thuat Nguyen, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

Okapi introduces instruction and response-ranked data in 26 diverse languages to facilitate the experiments and development of future multilingual LLM research.

Paper
Code

Boosting Punctuation Restoration with Data Generation and Reinforcement Learning

no code implementations • 24 Jul 2023 • Viet Dac Lai, Abel Salinas, Hao Tan, Trung Bui, Quan Tran, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Thien Huu Nguyen

Punctuation restoration is an important task in automatic speech recognition (ASR) which aim to restore the syntactic structure of generated ASR texts to improve readability.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection

no code implementations • 3 Jun 2023 • Minh Van Nguyen, Kishan Kc, Toan Nguyen, Thien Huu Nguyen, Ankit Chadha, Thuy Vu

In this paper, we propose to improve the candidate scoring by explicitly incorporating the dependencies between question-context and answer-context into the final representation of a candidate.

Open-Domain Question Answering Sentence

Paper
Add Code

ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning

no code implementations • 12 Apr 2023 • Viet Dac Lai, Nghia Trung Ngo, Amir Pouran Ben Veyseh, Hieu Man, Franck Dernoncourt, Trung Bui, Thien Huu Nguyen

The answer to this question requires a thorough evaluation of ChatGPT over multiple tasks with diverse languages and large datasets (i. e., beyond reported anecdotes), which is still missing or limited in current research.

Multilingual NLP Text Generation +1

Paper
Add Code

Textual Data Augmentation for Patient Outcomes Prediction

no code implementations • 13 Nov 2022 • Qiuhao Lu, Dejing Dou, Thien Huu Nguyen

Deep learning models have demonstrated superior performance in various healthcare applications.

Data Augmentation Language Modelling

Paper
Add Code

MEE: A Novel Multilingual Event Extraction Dataset

no code implementations • 11 Nov 2022 • Amir Pouran Ben Veyseh, Javid Ebrahimi, Franck Dernoncourt, Thien Huu Nguyen

Event Extraction (EE) is one of the fundamental tasks in Information Extraction (IE) that aims to recognize event mentions and their arguments (i. e., participants) from text.

Event Extraction

Paper
Add Code

MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection

no code implementations • NAACL 2022 • Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen

Event Detection (ED) is the task of identifying and classifying trigger words of event mentions in text.

Event Detection

Paper
Add Code

Improving Keyphrase Extraction with Data Augmentation and Information Filtering

no code implementations • 11 Sep 2022 • Amir Pouran Ben Veyseh, Nicole Meister, Franck Dernoncourt, Thien Huu Nguyen

Keyphrase extraction is one of the essential tasks for document understanding in NLP.

Data Augmentation document understanding +1

Paper
Add Code

Tutorial Recommendation for Livestream Videos using Discourse-Level Consistency and Ontology-Based Filtering

no code implementations • 11 Sep 2022 • Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

In order to alleviate this issue, one solution is to link the streaming videos with the relevant tutorial available for the tools used in the streaming video.

Paper
Add Code

Symlink: A New Dataset for Scientific Symbol-Description Linking

no code implementations • 26 Apr 2022 • Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

Mathematical symbols and descriptions appear in various forms across document section boundaries without explicit markup.

Paper
Add Code

MACRONYM: A Large-Scale Dataset for Multilingual and Multi-Domain Acronym Extraction

no code implementations • COLING 2022 • Amir Pouran Ben Veyseh, Nicole Meister, Seunghyun Yoon, Rajiv Jain, Franck Dernoncourt, Thien Huu Nguyen

Acronym extraction is the task of identifying acronyms and their expanded forms in texts that is necessary for various NLP applications.

Paper
Add Code

SemEval 2022 Task 12: Symlink- Linking Mathematical Symbols to their Descriptions

no code implementations • 19 Feb 2022 • Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

Given the increasing number of livestreaming videos, automatic speech recognition and post-processing for livestreaming video transcripts are crucial for efficient data management as well as knowledge mining.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction

1 code implementation • NAACL (ACL) 2022 • Minh Van Nguyen, Nghia Trung Ngo, Bonan Min, Thien Huu Nguyen

FAMIE is designed to address a fundamental problem in existing AL frameworks where annotators need to wait for a long time between annotation batches due to the time-consuming nature of model training and data selection at each AL iteration.

Active Learning Knowledge Distillation

Paper
Code

Predicting Patient Readmission Risk from Medical Text via Knowledge Graph Enhanced Multiview Graph Convolution

no code implementations • 19 Dec 2021 • Qiuhao Lu, Thien Huu Nguyen, Dejing Dou

Unplanned intensive care unit (ICU) readmission rate is an important metric for evaluating the quality of hospital care.

Representation Learning Time Series +1

Paper
Add Code

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

no code implementations • 1 Nov 2021 • Bonan Min, Hayley Ross, Elior Sulem, Amir Pouran Ben Veyseh, Thien Huu Nguyen, Oscar Sainz, Eneko Agirre, Ilana Heinz, Dan Roth

Large, pre-trained transformer-based language models such as BERT have drastically changed the Natural Language Processing (NLP) field.

Text Generation

Paper
Add Code

Exploiting Document Structures and Cluster Consistencies for Event Coreference Resolution

no code implementations • ACL 2021 • Hieu Minh Tran, Duy Phung, Thien Huu Nguyen

In addition, consistency constraints between golden and predicted clusters of event mentions have not been considered to improve representation learning in prior deep learning models for ECR.

coreference-resolution Event Coreference Resolution +1

Paper
Add Code

Unleash GPT-2 Power for Event Detection

no code implementations • ACL 2021 • Amir Pouran Ben Veyseh, Viet Lai, Franck Dernoncourt, Thien Huu Nguyen

To prevent the noises inevitable in automatically generated data from hampering training process, we propose to exploit a teacher-student architecture in which the teacher is supposed to learn anchor knowledge from the original data.

Event Detection Language Modelling

Paper
Add Code

DPR at SemEval-2021 Task 8: Dynamic Path Reasoning for Measurement Relation Extraction

no code implementations • SEMEVAL 2021 • Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

To this end, in this paper, we propose a novel model for the task of measurement relation extraction (MRE) whose goal is to recognize the relation between measured entities, quantities, and conditions mentioned in a document.

Relation Relation Extraction +1

Paper
Add Code

Dictionary-Guided Scene Text Recognition

1 code implementation • CVPR 2021 • Nguyen Nguyen, Thu Nguyen, Vinh Tran, Minh-Triet Tran, Thanh Duc Ngo, Thien Huu Nguyen, Minh Hoai

Language prior plays an important role in the way humans perceive and recognize text in the wild.

Scene Text Detection Scene Text Recognition +2

128

Paper
Code

Graph Convolutional Networks for Event Causality Identification with Rich Document-level Structures

no code implementations • NAACL 2021 • Minh Tran Phu, Thien Huu Nguyen

Although deep learning models have recently shown state-of-the-art performance for ECI, they are limited to the intra-sentence setting where event mention pairs are presented in the same sentences.

Event Causality Identification Sentence

Paper
Add Code

Fine-Grained Event Trigger Detection

no code implementations • EACL 2021 • Duong Le, Thien Huu Nguyen

Most of the previous work on Event Detection (ED) has only considered the datasets with a small number of event types (i. e., up to 38 types).

Event Detection Word Sense Disambiguation

Paper
Add Code

Cross-Task Instance Representation Interactions and Label Dependencies for Joint Information Extraction with Graph Convolutional Networks

no code implementations • NAACL 2021 • Minh Van Nguyen, Viet Dac Lai, Thien Huu Nguyen

Existing works on information extraction (IE) have mainly solved the four main tasks separately (entity mention recognition, relation extraction, event trigger detection, and argument extraction), thus failing to benefit from inter-dependencies between tasks.

Relation Extraction Representation Learning +1

Paper
Add Code

MadDog: A Web-based System for Acronym Identification and Disambiguation

1 code implementation • EACL 2021 • Amir Pouran Ben Veyseh, Franck Dernoncourt, Walter Chang, Thien Huu Nguyen

However, none of the existing works provide a unified solution capable of processing acronyms in various domains and to be publicly available.

Paper
Code

Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing

1 code implementation • EACL 2021 • Minh Van Nguyen, Viet Dac Lai, Amir Pouran Ben Veyseh, Thien Huu Nguyen

Finally, we create a demo video for Trankit at: https://youtu. be/q0KGP3zGjGc.

Ranked #1 on Sentence segmentation on UD2.5 test

Dependency Parsing Language Modelling +8

712

Paper
Code

Acronym Identification and Disambiguation Shared Tasks for Scientific Document Understanding

no code implementations • 22 Dec 2020 • Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen, Walter Chang, Leo Anthony Celi

To push forward research in this direction, we have organized two shared task for acronym identification and acronym disambiguation in scientific documents, named AI@SDU and AD@SDU, respectively.

document understanding

Paper
Add Code

Exploiting Node Content for Multiview Graph Convolutional Network and Adversarial Regularization

1 code implementation • COLING 2020 • Qiuhao Lu, Nisansa de Silva, Dejing Dou, Thien Huu Nguyen, Prithviraj Sen, Berthold Reinwald, Yunyao Li

Network representation learning (NRL) is crucial in the area of graph learning.

Graph Learning Link Prediction +3

Paper
Code

Structural and Functional Decomposition for Personality Image Captioning in a Communication Game

no code implementations • Findings of the Association for Computational Linguistics 2020 • Thu Nguyen, Duy Phung, Minh Hoai, Thien Huu Nguyen

Personality image captioning (PIC) aims to describe an image with a natural language caption given a personality trait.

Caption Generation Image Captioning +1

Paper
Add Code

The Dots Have Their Values: Exploiting the Node-Edge Connections in Graph-based Neural Models for Document-level Relation Extraction

no code implementations • Findings of the Association for Computational Linguistics 2020 • Hieu Minh Tran, Minh Trung Nguyen, Thien Huu Nguyen

However, this model does not capture the representations for the nodes in the graphs, thus preventing it from effectively encoding the specific and relevant information of the nodes for DRE.

Document-level Relation Extraction Representation Learning +1

Paper
Add Code

What Does This Acronym Mean? Introducing a New Dataset for Acronym Identification and Disambiguation

2 code implementations • COLING 2020 • Amir Pouran Ben Veyseh, Franck Dernoncourt, Quan Hung Tran, Thien Huu Nguyen

The proposed model outperforms the state-of-the-art models on the new AD dataset, providing a strong baseline for future research on this dataset.

Sentence

Paper
Code

Event Detection: Gate Diversity and Syntactic Importance Scoresfor Graph Convolution Neural Networks

no code implementations • 27 Oct 2020 • Viet Dac Lai, Tuan Ngo Nguyen, Thien Huu Nguyen

Recent studies on event detection (ED) haveshown that the syntactic dependency graph canbe employed in graph convolution neural net-works (GCN) to achieve state-of-the-art per-formance.

Event Detection

Paper
Add Code

Improving Aspect-based Sentiment Analysis with Gated Graph Convolutional Networks and Syntax-based Regulation

no code implementations • Findings of the Association for Computational Linguistics 2020 • Amir Pouran Ben Veyseh, Nasim Nour, Franck Dernoncourt, Quan Hung Tran, Dejing Dou, Thien Huu Nguyen

In addition, we propose a mechanism to obtain the importance scores for each word in the sentences based on the dependency trees that are then injected into the model to improve the representation vectors for ABSA.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Add Code

Graph Transformer Networks with Syntactic and Semantic Structures for Event Argument Extraction

no code implementations • Findings of the Association for Computational Linguistics 2020 • Amir Pouran Ben Veyseh, Tuan Ngo Nguyen, Thien Huu Nguyen

The goal of Event Argument Extraction (EAE) is to find the role of each entity mention for a given event trigger word.

Event Argument Extraction Inductive Bias +1

Paper
Add Code

Introducing Syntactic Structures into Target Opinion Word Extraction with Deep Learning

no code implementations • EMNLP 2020 • Amir Pouran Ben Veyseh, Nasim Nouri, Franck Dernoncourt, Dejing Dou, Thien Huu Nguyen

In this work, we propose to incorporate the syntactic structures of the sentences into the deep learning models for TOWE, leveraging the syntax-based opinion possibility scores and the syntactic connections between the words.

Ranked #3 on Aspect-oriented Opinion Extraction on SemEval-2014 Task-4

Aspect-Based Sentiment Analysis Aspect-oriented Opinion Extraction +1

Paper
Add Code

Exploiting the Syntax-Model Consistency for Neural Relation Extraction

no code implementations • ACL 2020 • Amir Pouran Ben Veyseh, Franck Dernoncourt, Dejing Dou, Thien Huu Nguyen

In order to overcome these issues, we propose a novel deep learning model for RE that uses the dependency trees to extract the syntax-based importance scores for the words, serving as a tree representation to introduce syntactic information into the models with greater generalization.

Multi-Task Learning Relation +1

Paper
Add Code

Extensively Matching for Few-shot Learning Event Detection

1 code implementation • WS 2020 • Viet Dac Lai, Franck Dernoncourt, Thien Huu Nguyen

In this work, weformulate event detection as a few-shot learn-ing problem to enable to extend event detec-tion to new event types.

Event Detection Few-Shot Learning

Paper
Code

Exploiting the Matching Information in the Support Set for Few Shot Event Classification

no code implementations • 13 Feb 2020 • Viet Dac Lai, Franck Dernoncourt, Thien Huu Nguyen

The existing event classification (EC) work primarily focuseson the traditional supervised learning setting in which models are unableto extract event mentions of new/unseen event types.

Classification Few-Shot Learning +2

Paper
Add Code

A Joint Model for Definition Extraction with Syntactic Connection and Semantic Consistency

1 code implementation • 5 Nov 2019 • Amir Pouran Ben Veyseh, Franck Dernoncourt, Dejing Dou, Thien Huu Nguyen

In this work, we propose a novel model for DE that simultaneously performs the two tasks in a single framework to benefit from their inter-dependencies.

Definition Extraction Multi-Task Learning +2

Paper
Code

Improving Slot Filling by Utilizing Contextual Information

no code implementations • WS 2020 • Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

To address this issue, in this paper, we propose a novel method to incorporate the contextual information in two different levels, i. e., representation level and task-specific (i. e., label) level.

Ranked #5 on Intent Detection on SNIPS

Intent Detection slot-filling +2

Paper
Add Code

On the Effectiveness of the Pooling Methods for Biomedical Relation Extraction with Deep Learning

no code implementations • WS 2019 • Tuan Ngo Nguyen, Franck Dernoncourt, Thien Huu Nguyen

Deep learning models have achieved state-of-the-art performances on many relation extraction datasets.

Relation Relation Extraction

Paper
Add Code

Extending Event Detection to New Types with Learning from Keywords

no code implementations • WS 2019 • Viet Dac Lai, Thien Huu Nguyen

We introduce a novel feature-based attention mechanism for convolutional neural networks for event detection in the new formulation.

Event Detection Sentence

Paper
Add Code

Language-independent Cross-lingual Contextual Representations

no code implementations • 25 Sep 2019 • Xiao Zhang, Song Wang, Dejing Dou, Xien Liu, Thien Huu Nguyen, Ji Wu

Contextual representation models like BERT have achieved state-of-the-art performance on a diverse range of NLP tasks.

Transfer Learning Zero-Shot Cross-Lingual Transfer

Paper
Add Code

Graph based Neural Networks for Event Factuality Prediction using Syntactic and Semantic Structures

1 code implementation • ACL 2019 • Amir Pouran Ben Veyseh, Thien Huu Nguyen, Dejing Dou

In this work, we introduce a novel graph-based neural network for EFP that can integrate the semantic and syntactic information more effectively.

Sentence

Paper
Code

Improving Cross-Domain Performance for Relation Extraction via Dependency Prediction and Information Flow Control

no code implementations • 7 Jul 2019 • Amir Pouran Ben Veyseh, Thien Huu Nguyen, Dejing Dou

The current deep learning models for relation extraction has mainly exploited this dependency information by guiding their computation along the structures of the dependency trees.

Domain Generalization Relation +1

Paper
Add Code

Employing the Correspondence of Relations and Connectives to Identify Implicit Discourse Relations via Label Embeddings

no code implementations • ACL 2019 • Linh The Nguyen, Linh Van Ngo, Khoat Than, Thien Huu Nguyen

It has been shown that implicit connectives can be exploited to improve the performance of the models for implicit discourse relation recognition (IDRR).

Multi-Task Learning

Paper
Add Code

One for All: Neural Joint Modeling of Entities and Events

no code implementations • 1 Dec 2018 • Trung Minh Nguyen, Thien Huu Nguyen

The previous work for event extraction has mainly focused on the predictions for event triggers and argument roles, treating entity mentions as being provided by human annotators.

Event Extraction

Paper
Add Code

Systematic Generalization: What Is Required and Can It Be Learned?

2 code implementations • ICLR 2019 • Dzmitry Bahdanau, Shikhar Murty, Michael Noukhovitch, Thien Huu Nguyen, Harm de Vries, Aaron Courville

Numerous models for grounded language understanding have been recently proposed, including (i) generic models that can be easily adapted to any given task and (ii) intuitively appealing modular models that require background knowledge to be instantiated.

Systematic Generalization Visual Question Answering (VQA)

Paper
Code

A Case Study on Learning a Unified Encoder of Relations

no code implementations • WS 2018 • Lisheng Fu, Bonan Min, Thien Huu Nguyen, Ralph Grishman

Typical relation extraction models are trained on a single corpus annotated with a pre-defined relation schema.

Knowledge Base Population Relation +1

Paper
Add Code

BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning

6 code implementations • ICLR 2019 • Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Salem Lahlou, Lucas Willems, Chitwan Saharia, Thien Huu Nguyen, Yoshua Bengio

Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and scientific reasons, but given the poor data efficiency of the current learning methods, this goal may require substantial research efforts.

Grounded language learning

2,016

Paper
Code

Similar but not the Same: Word Sense Disambiguation Improves Event Detection via Neural Representation Matching

no code implementations • EMNLP 2018 • Weiyi Lu, Thien Huu Nguyen

Event detection (ED) and word sense disambiguation (WSD) are two similar tasks in that they both involve identifying the classes (i. e. event types or word senses) of some word in a given sentence.

Event Detection Sentence +1

Paper
Add Code

Who is Killed by Police: Introducing Supervised Attention for Hierarchical LSTMs

no code implementations • COLING 2018 • Minh Nguyen, Thien Huu Nguyen

The early work in this field \cite{keith2017identifying} proposed a distant supervision framework based on Expectation Maximization (EM) to deal with the multiple appearances of the names in documents.

Paper
Add Code

A Deep Learning Model with Hierarchical LSTMs and Supervised Attention for Anti-Phishing

no code implementations • 3 May 2018 • Minh Nguyen, Toan Nguyen, Thien Huu Nguyen

Anti-phishing aims to detect phishing content/documents in a pool of textual data.

Sentence Text Categorization

Paper
Add Code

Domain Adaptation for Relation Extraction with Domain Adversarial Neural Network

no code implementations • IJCNLP 2017 • Lisheng Fu, Thien Huu Nguyen, Bonan Min, Ralph Grishman

Our method is a joint model consisting of a CNN-based relation classifier and a domain-adversarial classifier.

Relation Relation Extraction +1

Paper
Add Code

Joint Learning of Local and Global Features for Entity Linking via Neural Networks

no code implementations • COLING 2016 • Thien Huu Nguyen, Nicolas Fauceglia, Mariano Rodriguez Muro, Oktie Hassanzadeh, Alfio Massimiliano Gliozzo, Mohammad Sadoghi

Previous studies have highlighted the necessity for entity linking systems to capture the local entity-mention similarities and the global topical coherence.

Domain Adaptation Entity Linking

Paper
Add Code

Modeling Skip-Grams for Event Detection with Convolutional Neural Networks

no code implementations • EMNLP 2016 • Thien Huu Nguyen, Ralph Grishman

Domain Adaptation Event Detection +1

Paper
Add Code

A Two-stage Approach for Extending Event Detection to New Types via Neural Networks

no code implementations • WS 2016 • Thien Huu Nguyen, Lisheng Fu, Kyunghyun Cho, Ralph Grishman

Domain Adaptation Event Detection +2

Paper
Add Code

Joint Event Extraction via Recurrent Neural Networks

1 code implementation • NAACL 2016 • Thien Huu Nguyen, Kyunghyun Cho, Ralph Grishman

Event Extraction Structured Prediction

Paper
Code

Toward Mention Detection Robustness with Recurrent Neural Networks

no code implementations • 24 Feb 2016 • Thien Huu Nguyen, Avirup Sil, Georgiana Dinu, Radu Florian

One of the key challenges in natural language processing (NLP) is to yield good performance across application domains and languages.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Combining Neural Networks and Log-linear Models to Improve Relation Extraction

no code implementations • 18 Nov 2015 • Thien Huu Nguyen, Ralph Grishman

The last decade has witnessed the success of the traditional feature-based method on exploiting the discrete structures such as words or lexical patterns to extract relations from text.

Ranked #1 on Relation Extraction on ACE 2005 (Cross Sentence metric)