Search Results for author: Qinan Yu

Found 4 papers, 2 papers with code

Grokking Group Multiplication with Cosets

no code implementations • 11 Dec 2023 • Dashiell Stander, Qinan Yu, Honglu Fan, Stella Biderman

We use the group Fourier transform over the symmetric group $S_n$ to reverse engineer a 1-layer feedforward network that has "grokked" the multiplication of $S_5$ and $S_6$.

Paper
Add Code

Characterizing Mechanisms for Factual Recall in Language Models

no code implementations • 24 Oct 2023 • Qinan Yu, Jack Merullo, Ellie Pavlick

By scaling up or down the value vector of these heads, we can control the likelihood of using the in-context answer on new data.

counterfactual

Paper
Add Code

Are Language Models Worse than Humans at Following Prompts? It's Complicated

1 code implementation • 17 Jan 2023 • Albert Webson, Alyssa Marie Loo, Qinan Yu, Ellie Pavlick

However, recent work finds that models can perform surprisingly well when given intentionally irrelevant or misleading prompts.

Paper
Code

Does CLIP Bind Concepts? Probing Compositionality in Large Image Models

1 code implementation • 20 Dec 2022 • Martha Lewis, Nihal V. Nayak, Peilin Yu, Qinan Yu, Jack Merullo, Stephen H. Bach, Ellie Pavlick

In this work, we focus on the ability of a large pretrained vision and language model (CLIP) to encode compositional concepts and to bind variables in a structure-sensitive way (e. g., differentiating ''cube behind sphere'' from ''sphere behind cube'').

Language Modelling Open-Ended Question Answering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.