Yuxin Wu

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Training Efficiency & Optimization (1)Interpretability & Mechanistic Interp (1)RLHF & Preference Learning (1)

Frequent co-authors

Kimi Team (1)Guangyu Chen (1)Jianlin Su (1)Weixin Xu (1)

Papers (2)

Mar 16, 2026

Mar 16, 2026·also Cohere, Moonshot

Attention Residuals

Forget fixed residual connections: Attention Residuals let each layer selectively attend to previous layers, boosting performance and gradient flow in deep LLMs.

Kimi Team, Guangyu Chen, Jianlin Su +30

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Feb 15, 2026

DAMOFeb 15, 2026·also CAS

Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric

Ditch the black-box reward function: this new rubric-based RL framework uses LLMs to judge responses against interpretable criteria, offering a more robust and transparent approach to alignment.

Ruipeng Jia, Yunyi Yang, Yuxin Wu +2

Interpretability & Mechanistic Interp RLHF & Preference Learning Scalable Oversight & Alignment Theory

Search

Yuxin Wu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)