Search Results for author: Shikhar Vashishth

Found 22 papers, 16 papers with code

A Morphology-Based Investigation of Positional Encodings

no code implementations • 6 Apr 2024 • Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya

How does the importance of positional encoding in pre-trained language models (PLMs) vary across languages with different morphological complexity?

Dependency Parsing named-entity-recognition +3

Paper
Add Code

LLM Augmented LLMs: Expanding Capabilities through Composition

1 code implementation • 4 Jan 2024 • Rachit Bansal, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Shikhar Vashishth, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, Partha Talukdar

Foundational models with billions of parameters which have been trained on large corpora of data have demonstrated non-trivial skills in a variety of domains.

Arithmetic Reasoning Code Generation

137

Paper
Code

Self-Influence Guided Data Reweighting for Language Model Pre-training

no code implementations • 2 Nov 2023 • Megh Thakkar, Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, Sarath Chandar, Partha Talukdar

Once the pre-training corpus has been assembled, all data samples in the corpus are treated with equal importance during LM pre-training.

Language Modelling

Paper
Add Code

Multimodal Modeling For Spoken Language Identification

no code implementations • 19 Sep 2023 • Shikhar Bharadwaj, Min Ma, Shikhar Vashishth, Ankur Bapna, Sriram Ganapathy, Vera Axelrod, Siddharth Dalmia, Wei Han, Yu Zhang, Daan van Esch, Sandy Ritchie, Partha Talukdar, Jason Riesa

Spoken language identification refers to the task of automatically predicting the spoken language in a given utterance.

Language Identification Spoken language identification

Paper
Add Code

MASR: Multi-label Aware Speech Representation

no code implementations • 20 Jul 2023 • Anjali Raj, Shikhar Bharadwaj, Sriram Ganapathy, Min Ma, Shikhar Vashishth

In the recent years, speech representation learning is constructed primarily as a self-supervised learning (SSL) task, using the raw audio signal alone, while ignoring the side-information that is often available for a given speech recording.

Emotion Recognition Language Identification +4

Paper
Add Code

Label Aware Speech Representation Learning For Language Identification

no code implementations • 7 Jun 2023 • Shikhar Vashishth, Shikhar Bharadwaj, Sriram Ganapathy, Ankur Bapna, Min Ma, Wei Han, Vera Axelrod, Partha Talukdar

In this paper, we propose a novel framework of combining self-supervised representation learning with the language label information for the pre-training task.

Language Identification Missing Labels +3

Paper
Add Code

Knowledge-Rich Self-Supervision for Biomedical Entity Linking

no code implementations • 15 Dec 2021 • Sheng Zhang, Hao Cheng, Shikhar Vashishth, Cliff Wong, Jinfeng Xiao, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

Zero-shot entity linking has emerged as a promising direction for generalizing to new entities, but it still requires example gold entity mentions during training and canonical descriptions for all entities, both of which are rarely available outside of Wikipedia.

Contrastive Learning Entity Linking

Paper
Add Code

Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network

1 code implementation • ACL 2021 • Justin Lovelace, Denis Newman-Griffis, Shikhar Vashishth, Jill Fain Lehman, Carolyn Penstein Rosé

We develop a deep convolutional network that utilizes textual entity representations and demonstrate that our model outperforms recent KG completion methods in this challenging setting.

Knowledge Graph Completion Re-Ranking

Paper
Code

DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues

2 code implementations • ICLR 2021 • Rishabh Joshi, Vidhisha Balachandran, Shikhar Vashishth, Alan Black, Yulia Tsvetkov

To successfully negotiate a deal, it is not enough to communicate fluently: pragmatic planning of persuasive negotiation strategies is essential.

Response Generation

Paper
Code

MedFilter: Improving Extraction of Task-relevant Utterances from Doctor-Patient Conversations through Integration of Discourse Structure and Ontological Knowledge

1 code implementation • EMNLP 2020 • Sopan Khosla, Shikhar Vashishth, Jill Fain Lehman, Carolyn Rose

In this paper, we propose the novel modeling approach MedFilter, which addresses these insights in order to increase performance at identifying and categorizing task-relevant utterances, and in so doing, positively impacts performance at a downstream information extraction task.

Paper
Code

Improving Broad-Coverage Medical Entity Linking with Semantic Type Prediction and Large-Scale Datasets

1 code implementation • 1 May 2020 • Shikhar Vashishth, Denis Newman-Griffis, Rishabh Joshi, Ritam Dutt, Carolyn Rose

To address the dearth of annotated training data for medical entity linking, we present WikiMed and PubMedDS, two large-scale medical entity linking datasets, and demonstrate that pre-training MedType on these datasets further improves entity linking performance.

Entity Disambiguation Entity Linking +2

115

Paper
Code

A Re-evaluation of Knowledge Graph Completion Methods

2 code implementations • ACL 2020 • Zhiqing Sun, Shikhar Vashishth, Soumya Sanyal, Partha Talukdar, Yiming Yang

Knowledge Graph Completion (KGC) aims at automatically predicting missing links for large-scale knowledge graphs.

Ranked #25 on Link Prediction on FB15k-237 (MR metric)

Link Prediction

202

Paper
Code

Neural Graph Embedding Methods for Natural Language Processing

2 code implementations • 8 Nov 2019 • Shikhar Vashishth

Knowledge graphs are structured representations of facts in a graph, where nodes represent entities and edges represent relationships between them.

Graph Embedding Knowledge Graphs +3

781

Paper
Code

Composition-based Multi-Relational Graph Convolutional Networks

4 code implementations • ICLR 2020 • Shikhar Vashishth, Soumya Sanyal, Vikram Nitin, Partha Talukdar

Multi-relational graphs are a more general and prevalent form of graphs where each edge has a label and direction associated with it.

Ranked #22 on Link Prediction on FB15k-237

General Classification Graph Classification +3

13,063

Paper
Code

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

1 code implementation • 1 Nov 2019 • Shikhar Vashishth, Soumya Sanyal, Vikram Nitin, Nilesh Agrawal, Partha Talukdar

In this paper, we analyze how increasing the number of these interactions affects link prediction performance, and utilize our observations to propose InteractE.

Ranked #11 on Link Prediction on YAGO3-10

Knowledge Graph Embeddings Knowledge Graphs +1

Paper
Code

Attention Interpretability Across NLP Tasks

2 code implementations • 24 Sep 2019 • Shikhar Vashishth, Shyam Upadhyay, Gaurav Singh Tomar, Manaal Faruqui

The attention layer in a neural network model provides insights into the model's reasoning behind its prediction, which are usually criticized for being opaque.

6,442

Paper
Code

CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information

1 code implementation • 1 Feb 2019 • Shikhar Vashishth, Prince Jain, Partha Talukdar

Open Information Extraction (OpenIE) methods extract (noun phrase, relation phrase, noun phrase) triples from text, resulting in the construction of large Open Knowledge Bases (Open KBs).

Ranked #1 on Noun Phrase Canonicalization on Ambiguous Dataset

Clustering Feature Engineering +4

100

Paper
Code

Dating Documents using Graph Convolution Networks

1 code implementation • ACL 2018 • Shikhar Vashishth, Shib Sankar Dasgupta, Swayambhu Nath Ray, Partha Talukdar

While existing approaches for these tasks assume accurate knowledge of the document date, this is not always available, especially for arbitrary documents from the Web.

Ranked #1 on Document Dating on APW

Document Dating Event Detection +1

Paper
Code

Confidence-based Graph Convolutional Networks for Semi-Supervised Learning

1 code implementation • 24 Jan 2019 • Shikhar Vashishth, Prateek Yadav, Manik Bhandari, Partha Talukdar

Graph-based Semi-Supervised Learning (SSL) methods aim to address this problem by labeling a small subset of the nodes as seeds and then utilizing the graph structure to predict label scores for the rest of the nodes in the graph.

Paper
Code

RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information

1 code implementation • EMNLP 2018 • Shikhar Vashishth, Rishabh Joshi, Sai Suman Prayaga, Chiranjib Bhattacharyya, Partha Talukdar

In this paper, we propose RESIDE, a distantly-supervised neural relation extraction method which utilizes additional side information from KBs for improved relation extraction.

Ranked #5 on Relation Extraction on NYT Corpus