Search Results for author: Anirudh Srinivasan

Found 13 papers, 5 papers with code

BERTologiCoMix: How does Code-Mixing interact with Multilingual BERT?

no code implementations • EACL (AdaptNLP) 2021 • Sebastin Santy, Anirudh Srinivasan, Monojit Choudhury

Models such as mBERT and XLMR have shown success in solving Code-Mixed NLP tasks even though they were not exposed to such text during pretraining.

Paper
Add Code

Counterfactually Probing Language Identity in Multilingual Models

1 code implementation • 29 Oct 2023 • Anirudh Srinivasan, Venkata S Govindarajan, Kyle Mahowald

We use one such technique, AlterRep, a method of counterfactual probing, to explore the internal structure of multilingual models (mBERT and XLM-R).

counterfactual Masked Language Modeling +1

Paper
Code

Textless Low-Resource Speech-to-Speech Translation With Unit Language Models

1 code implementation • 24 May 2023 • Anuj Diwan, Anirudh Srinivasan, David Harwath, Eunsol Choi

We train and evaluate our models for English-to-German, German-to-English and Marathi-to-English translation on three different domains (European Parliament, Common Voice, and All India Radio) with single-speaker synthesized speech data.

Automatic Speech Recognition Denoising +6

Paper
Code

TyDiP: A Dataset for Politeness Classification in Nine Typologically Diverse Languages

1 code implementation • 29 Nov 2022 • Anirudh Srinivasan, Eunsol Choi

We study politeness phenomena in nine typologically diverse languages.

Paper
Code

CALCS 2021 Shared Task: Machine Translation for Code-Switched Data

no code implementations • 19 Feb 2022 • Shuguang Chen, Gustavo Aguilar, Anirudh Srinivasan, Mona Diab, Thamar Solorio

For the unsupervised setting, we provide the following language pairs: English and Spanish-English (Eng-Spanglish), and English and Modern Standard Arabic-Egyptian Arabic (Eng-MSAEA) in both directions.

Language Identification Machine Translation +3

Paper
Add Code

Predicting the Performance of Multilingual NLP Models

no code implementations • 17 Oct 2021 • Anirudh Srinivasan, Sunayana Sitaram, Tanuja Ganu, Sandipan Dandapat, Kalika Bali, Monojit Choudhury

Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages.

Multilingual NLP

Paper
Add Code

GCM: A Toolkit for Generating Synthetic Code-mixed Text

1 code implementation • EACL 2021 • Mohd Sanad Zaki Rizvi, Anirudh Srinivasan, Tanuja Ganu, Monojit Choudhury, Sunayana Sitaram

Code-mixing is common in multilingual communities around the world, and processing it is challenging due to the lack of labeled and unlabeled data.

Paper
Code

MSR India at SemEval-2020 Task 9: Multilingual Models Can Do Code-Mixing Too

no code implementations • SEMEVAL 2020 • Anirudh Srinivasan

In this paper, we present our system for the SemEval 2020 task on code-mixed sentiment analysis.

Sentiment Analysis

Paper
Add Code

GLUECoS: An Evaluation Benchmark for Code-Switched NLP

no code implementations • ACL 2020 • Simran Khanuja, D, S apat, ipan, Anirudh Srinivasan, Sunayana Sitaram, Monojit Choudhury

We present results on all these tasks using cross-lingual word embedding models and multilingual models.

Language Identification named-entity-recognition +7

Paper
Add Code

Code-mixed parse trees and how to find them

no code implementations • LREC 2020 • Anirudh Srinivasan, D, S apat, ipan, Monojit Choudhury

In this paper, we explore the methods of obtaining parse trees of code-mixed sentences and analyse the obtained trees.

Paper
Add Code

GLUECoS : An Evaluation Benchmark for Code-Switched NLP

no code implementations • 26 Apr 2020 • Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram, Monojit Choudhury

We present results on all these tasks using cross-lingual word embedding models and multilingual models.

Language Identification named-entity-recognition +7

Paper
Add Code

Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities

no code implementations • ICON 2019 • Pratik Joshi, Christain Barnes, Sebastin Santy, Simran Khanuja, Sanket Shah, Anirudh Srinivasan, Satwik Bhattamishra, Sunayana Sitaram, Monojit Choudhury, Kalika Bali

In this paper, we examine and analyze the challenges associated with developing and introducing language technologies to low-resource language communities.

Paper
Add Code

Automated curriculum generation for Policy Gradients from Demonstrations

1 code implementation • 1 Dec 2019 • Anirudh Srinivasan, Dzmitry Bahdanau, Maxime Chevalier-Boisvert, Yoshua Bengio

In this paper, we present a technique that improves the process of training an agent (using RL) for instruction following.

Instruction Following

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.