Lattice AI Research

Research focus

Reasoning & Chain-of-Thought (2)RLHF & Preference Learning (2)Interpretability & Mechanistic Interp (1)Training Efficiency & Optimization (1)

Frequent co-authors

Dan Shi (1)S. Ostermann (1)Josef van Genabith (1)Deyi Xiong (1)

Papers (2)

Apr 27, 2026

Apr 27, 2026·also DFKI

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

RL's superior generalization isn't about brute force, but about carefully sculpting a few key features while preserving the base model's knowledge, unlike SFT's rapid specialization.

Dan Shi, S. Ostermann, Renren Jin +2

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought RLHF & Preference Learning

Apr 14, 2026

Apr 14, 2026·also Baidu, CAS

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Forget brute-force hinting: KnowRL distills knowledge into atomic units, then uses subset selection to find the *least* amount of guidance needed to supercharge LLM reasoning.

Linhao Yu, Tianmeng Yang, Renren Jin +7

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

Search

Renren Jin

Research focus

Frequent co-authors

Papers (2)