Lu Yin

University of Surrey 2 Stanford University 3 University of Notre Dame *Correspondence: f.neri@surrey.ac.uk, zwang43@nd.edu

Stanford HAI

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (3)Training Efficiency & Optimization (2)Interpretability & Mechanistic Interp (1)Natural Language Processing (1)

Frequent co-authors

Xiaolong Han (1)Ferrante Neri (1)Zijian Jiang (1)Fang Wu (1)

Papers (3)

Mar 16, 2026

Stanford HAIMar 16, 2026·also Notre Dame

W2T: LoRA Weights Already Know What They Can Do

Unlock the secrets hidden within LoRA weights: a novel method reveals that these weights already encode adapter behavior and performance, enabling accurate predictions without running the base model or accessing training data.

Xiaolong Han, Ferrante Neri, Zijian Jiang +3

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Training Efficiency & Optimization

Mar 5, 2026

Stanford HAIMar 5, 2026·also AGI Lab, HKU, HKUST, NUDT +3

Progressive Residual Warmup for Language Model Pretraining

By strategically warming up residual connections layer-by-layer, ProRes unlocks faster and more stable pretraining for language models.

Tianhao Chen, Tianhao Chen, Xin Xu +8

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Scaling Laws & Emergent Abilities+1

Feb 26, 2026

Stanford HAIFeb 26, 2026

Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?

DLMs aren't truly parallel because their training data is too sequential, but NAP shows how data curation can unlock genuine parallel decoding and boost reasoning performance.

Pengxiang Li, Dilxat Muhtar, Lu Yin +2

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Lu Yin

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)