Yanru Chen

Provable adversarial repair of Transformers is now possible beyond the last layer, thanks to a new framework that formulates repair as a tractable convex optimization problem.

Hsin-Ling Hsu, Minyu Chen, Nai-Chia Chen +3

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Red-Teaming & Adversarial Robustness

Mar 16, 2026

Mar 16, 2026·also Cohere, Moonshot, UCSD, Xidian

Attention Residuals

Forget fixed residual connections: Attention Residuals let each layer selectively attend to previous layers, boosting performance and gradient flow in deep LLMs.

Kimi Team, Jianlin Su, Weixin Xu +28

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Search

Yanru Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)