Search Results for author: Ionnis Konstas

Found 1 papers, 0 papers with code

N2G: A Scalable Approach for Quantifying Interpretable Neuron Representations in Large Language Models

no code implementations • 22 Apr 2023 • Alex Foote, Neel Nanda, Esben Kran, Ionnis Konstas, Fazl Barez

Understanding the function of individual neurons within language models is essential for mechanistic interpretability research.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.