S. Lapuschkin

Papers on Lattice

Total citations

Topics

h-index

Research focus

Interpretability & Mechanistic Interp (3)Natural Language Processing (1)Eval Frameworks & Benchmarks (1)Computer Vision (1)Multimodal Models (1)

Frequent co-authors

Wojciech Samek (3)S. Ostermann (1)Daniil Gurgurov (1)Tanja Baeumel (1)

Papers (3)

Apr 15, 2026

From Weights to Activations: Is Steering the Next Frontier of Adaptation?

Steering isn't just a trick; it's a fundamentally different way to adapt language models, offering localized, reversible control that traditional fine-tuning can't match.

S. Ostermann, Daniil Gurgurov, Tanja Baeumel +9

Interpretability & Mechanistic Interp Natural Language Processing

Mar 31, 2026

Mohammad Mesgari +3Mar 31, 2026

Structural Compactness as a Complementary Criterion for Explanation Quality

Forget IoU, measuring the structural compactness of attribution maps with Minimum Spanning Trees reveals fundamental differences in how models explain themselves.

Mohammad Mesgari, Wojciech Samek, S. Lapuschkin +1

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp

May 26, 2025

May 26, 2025·also Technical University of Berlin

From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance

CLIP models exhibit surprising reliance on latent components encoding polysemous words, visual typography, and dataset artifacts, revealing hidden biases that can be amplified in downstream tasks.

Maximilian Dreyer, Lorenz Hufe, J. Berend +3

Computer Vision Interpretability & Mechanistic Interp Multimodal Models

Search

S. Lapuschkin

Research focus

Frequent co-authors

Papers (3)