Lattice AI Research

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Training Efficiency & Optimization (2)Scaling Laws & Emergent Abilities (1)Distributed Systems & Hardware (1)

Frequent co-authors

Mingze Wang (1)Shuchen Zhu (1)Binghui Li (1)Kai Shen (1)

Papers (2)

May 26, 2026

3w ago

Negligible in Size, Significant in Effect: On Scale Vectors in Large Language Models

Scale vectors, despite being a tiny fraction of LLM parameters, are critical for pre-training, and this paper unlocks how to make them even better with simple, theoretically-grounded tweaks.

Mingze Wang, Shuchen Zhu, Yuxin Fang +3

Architecture Design (Transformers, SSMs, MoE)Scaling Laws & Emergent Abilities Training Efficiency & Optimization

Mar 16, 2026

Tsinghua AIMar 16, 2026·also ByteDance, PKU, SJTU

Mixture-of-Depths Attention

LLMs can now scale depth more effectively: a new attention mechanism recovers diluted features in deeper layers, boosting performance with negligible overhead.

Lianghui Zhu, Yuxin Fang, Bencheng Liao +8

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Search

Yuxin Fang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)