Deyi Xiong

TJUNLP Lab, School of Computer Science and Technology, Tianjin University, China

Papers on Lattice

Total citations

Topics

h-index

Research focus

Interpretability & Mechanistic Interp (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

Dan Shi (1)S. Ostermann (1)Renren Jin (1)Josef van Genabith (1)

Papers (1)

Apr 27, 2026

Apr 27, 2026·also DFKI

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

RL's superior generalization isn't about brute force, but about carefully sculpting a few key features while preserving the base model's knowledge, unlike SFT's rapid specialization.

Dan Shi, S. Ostermann, Renren Jin +2

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Deyi Xiong

Research focus

Frequent co-authors

Papers (1)