Recent posts
Stable Explanations with Multiplicative Smoothing
Achieving stability guarantees for feature attribution methods
Do Machine Learning Models Learn Statistical Rules Inferred from Data?
Understanding and improving model predictions using rules learned from data.
Faithful Chain-of-Thought Reasoning
A novel prompting method that provides faithful explanations and improves performance on complex reasoning tasks
In-context Learning Influences
An influence framework for studying in-context learning examples
Adversarial Prompting
Finding phrases that bias image and text generation towards unexpected outputs