Search Results for author: Ionnis Konstas

Found 1 papers, 0 papers with code

N2G: A Scalable Approach for Quantifying Interpretable Neuron Representations in Large Language Models

no code implementations22 Apr 2023 Alex Foote, Neel Nanda, Esben Kran, Ionnis Konstas, Fazl Barez

Understanding the function of individual neurons within language models is essential for mechanistic interpretability research.

Cannot find the paper you are looking for? You can Submit a new open access paper.