Search Results for author: Anas Awadalla

Found 7 papers, 4 papers with code

VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use

1 code implementation12 Aug 2023 Yonatan Bitton, Hritik Bansal, Jack Hessel, Rulin Shao, Wanrong Zhu, Anas Awadalla, Josh Gardner, Rohan Taori, Ludwig Schmidt

These descriptions enable 1) collecting human-verified reference outputs for each instance; and 2) automatic evaluation of candidate multimodal generations using a text-only LLM, aligning with human judgment.

Instruction Following

Are aligned neural networks adversarially aligned?

no code implementations NeurIPS 2023 Nicholas Carlini, Milad Nasr, Christopher A. Choquette-Choo, Matthew Jagielski, Irena Gao, Anas Awadalla, Pang Wei Koh, Daphne Ippolito, Katherine Lee, Florian Tramer, Ludwig Schmidt

We show that existing NLP-based optimization attacks are insufficiently powerful to reliably attack aligned text models: even when current NLP-based attacks fail, we can find adversarial inputs with brute force.

Reliable and Trustworthy Machine Learning for Health Using Dataset Shift Detection

no code implementations NeurIPS 2021 Chunjong Park, Anas Awadalla, Tadayoshi Kohno, Shwetak Patel

We then translate the out-of-distribution score into a human interpretable CONFIDENCE SCORE to investigate its effect on the users' interaction with health ML applications.

BIG-bench Machine Learning Medical Diagnosis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.