Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Hanchi Sun | Lattice

Hanchi Sun

Papers on Lattice

2

Total citations

0

Topics

3

h-index

4

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Distributed Systems & Hardware (2)Training Efficiency & Optimization (2)

Frequent co-authors

Zhengqing Yuan (1)Lichao Sun (1)Yanfang Ye (1)Yixin Liu (1)

Papers (2)

Apr 6, 2026

Zhengqing Yuan +33w ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Training massive LLMs on a single GPU is now possible, potentially democratizing access to large-scale model development.

Zhengqing Yuan, Hanchi Sun, Lichao Sun +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Mar 12, 2026

Hanchi Sun +3Mar 12, 2026

Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing

Forget auxiliary losses and fixed expert capacity: Expert Threshold routing dynamically allocates computation in MoEs and balances expert load, all while boosting data efficiency by 1.6x.

Hanchi Sun, Yixin Liu, Yonghui Wu +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Yonghui Wu (1)