S. Ostermann

German Research Center for Artificial Intelligence (DFKI), Saarbrücken, Germany

Papers on Lattice

Total citations

Topics

h-index

Research focus

Interpretability & Mechanistic Interp (2)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Natural Language Processing (1)

Frequent co-authors

Dan Shi (1)Renren Jin (1)Josef van Genabith (1)Deyi Xiong (1)

Papers (2)

Apr 27, 2026

Apr 27, 2026·also DFKI

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

RL's superior generalization isn't about brute force, but about carefully sculpting a few key features while preserving the base model's knowledge, unlike SFT's rapid specialization.

Dan Shi, S. Ostermann, Renren Jin +2

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought RLHF & Preference Learning

Apr 15, 2026

From Weights to Activations: Is Steering the Next Frontier of Adaptation?

Steering isn't just a trick; it's a fundamentally different way to adapt language models, offering localized, reversible control that traditional fine-tuning can't match.

S. Ostermann, Daniil Gurgurov, Tanja Baeumel +9

Interpretability & Mechanistic Interp Natural Language Processing

Search

S. Ostermann

Research focus

Frequent co-authors

Papers (2)