Can Rager

University College London

Papers on Lattice

Total citations

Topics

Research focus

Interpretability & Mechanistic Interp (2)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Matthew Kowal (2)Vasudev Shyam (2)Sheridan Feucht (2)Usha Bhalla (2)

Papers (2)

May 6, 2026

May 6, 2026·also Stanford HAI, Harvard, Northeastern

Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior

Steering neural networks through the intrinsic geometry of their activations unlocks more natural and controllable behaviors than traditional linear interventions.

Can Rager, Matthew Kowal, Vasudev Shyam +12

Interpretability & Mechanistic Interp

Apr 30, 2026

Stanford HAIApr 30, 2026·also Northeastern, UCL

Do Sparse Autoencoders Capture Concept Manifolds?

Sparse autoencoders, despite their popularity for extracting interpretable features, often fail to capture the underlying manifold structure of concepts, instead fragmenting them across multiple, diluted features.

Usha Bhalla, Usha Bhalla, Thomas Fel +20

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp

Search

Can Rager

Research focus

Frequent co-authors

Papers (2)