Dan Shi

TJUNLP Lab, School of Computer Science and Technology, Tianjin University, China, College of Intelligence and Computing, Tianjin University

Papers on Lattice

Total citations

Topics

h-index

Research focus

Interpretability & Mechanistic Interp (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

S. Ostermann (1)Renren Jin (1)Josef van Genabith (1)Deyi Xiong (1)

Papers (1)

Apr 27, 2026

Apr 27, 2026·also DFKI

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

RL's superior generalization isn't about brute force, but about carefully sculpting a few key features while preserving the base model's knowledge, unlike SFT's rapid specialization.

Dan Shi, S. Ostermann, Renren Jin +2

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Dan Shi

Research focus

Frequent co-authors

Papers (1)