1 code implementation • 5 Feb 2024 • Vinitra Swamy, Syrielle Montariol, Julian Blackwell, Jibril Frej, Martin Jaggi, Tanja Käser
Interpretability for neural networks is a trade-off between three key requirements: 1) faithfulness of the explanation (i. e., how perfectly it explains the prediction), 2) understandability of the explanation by humans, and 3) model performance.