Search Results for author: Zayd Hammoudeh

Found 8 papers, 6 papers with code

What Models Know About Their Attackers: Deriving Attacker Information From Latent Representations

no code implementations • EMNLP (BlackboxNLP) 2021 • Zhouhang Xie, Jonathan Brophy, Adam Noack, Wencong You, Kalyani Asthana, Carter Perkins, Sabrina Reis, Zayd Hammoudeh, Daniel Lowd, Sameer Singh

Adversarial attacks curated against NLP models are increasingly becoming practical threats.

Abuse Detection Adversarial Text +3

Paper
Add Code

Large Language Models Are Better Adversaries: Exploring Generative Clean-Label Backdoor Attacks Against Text Classifiers

no code implementations • 28 Oct 2023 • Wencong You, Zayd Hammoudeh, Daniel Lowd

Backdoor attacks manipulate model predictions by inserting innocuous triggers into training and test data.

Paper
Add Code

Provable Robustness Against a Union of $\ell_0$ Adversarial Attacks

2 code implementations • 22 Feb 2023 • Zayd Hammoudeh, Daniel Lowd

Sparse or $\ell_0$ adversarial attacks arbitrarily perturb an unknown subset of the features.

7

Paper
Code

Training Data Influence Analysis and Estimation: A Survey

1 code implementation • 9 Dec 2022 • Zayd Hammoudeh, Daniel Lowd

Good models require good training data.

54

Paper
Code

Reducing Certified Regression to Certified Classification for General Poisoning Attacks

1 code implementation • 29 Aug 2022 • Zayd Hammoudeh, Daniel Lowd

We also show that the assumptions made by existing state-of-the-art certified classifiers are often overly pessimistic.

Classification regression

2

Paper
Code

Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision Trees

1 code implementation • 30 Apr 2022 • Jonathan Brophy, Zayd Hammoudeh, Daniel Lowd

In the pursuit of better understanding GBDT predictions and generally improving these models, we adapt recent and popular influence-estimation methods designed for deep learning models to GBDTs.

Decision Making

21

Paper
Code

Identifying a Training-Set Attack's Target Using Renormalized Influence Estimation

1 code implementation • 25 Jan 2022 • Zayd Hammoudeh, Daniel Lowd

This work proposes the task of target identification, which determines whether a specific test instance is the target of a training-set attack.

7

Paper
Code

Learning from Positive and Unlabeled Data with Arbitrary Positive Shift

1 code implementation • NeurIPS 2020 • Zayd Hammoudeh, Daniel Lowd

A common simplifying assumption is that the positive data is representative of the target positive class.

14

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.