Josef van Genabith

German Research Center for Artificial Intelligence (DFKI), Saarbrücken, Germany

Papers on Lattice

Total citations

Topics

h-index

Research focus

Interpretability & Mechanistic Interp (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

Dan Shi (1)S. Ostermann (1)Renren Jin (1)Deyi Xiong (1)

Papers (1)

Apr 27, 2026

Apr 27, 2026·also DFKI

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

RL's superior generalization isn't about brute force, but about carefully sculpting a few key features while preserving the base model's knowledge, unlike SFT's rapid specialization.

Dan Shi, S. Ostermann, Renren Jin +2

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Josef van Genabith

Research focus

Frequent co-authors

Papers (1)