Youhe Jiang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Inference & Quantization (3)Distributed Systems & Hardware (2)Tool Use & Agents (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Ran Yan (1)You Peng (1)Wenshuang Li (1)Taiyi Wang (1)

Papers (3)

Apr 8, 2026

2w ago·also SJTU

Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics

Forget static policies: Autopoiesis uses LLMs to continuously rewrite serving policy code, adapting to runtime dynamics in ways human-designed systems can't.

Youhe Jiang, Ran Yan, You Peng +4

Distributed Systems & Hardware Inference & Quantization

Feb 16, 2026

Feb 16, 2026·also SJTU

Efficient Multi-round LLM Inference over Disaggregated Serving

Multi-round LLM inference gets a major speed boost with AMPD, a new disaggregated serving framework that intelligently manages interleaved prefill-decode workloads.

Wenhao He, Youhe Jiang, Penghao Zhao +1

Distributed Systems & Hardware Inference & Quantization Tool Use & Agents

Feb 13, 2026

Tsinghua AIFeb 13, 2026·also BAIR

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Achieve an 18.6x speedup in video diffusion models with 97% attention sparsity by learning how to route and combine sparse and linear attention, outperforming heuristic approaches.

Jintao Zhang, Haoxu Wang, Kai Jiang +5

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Search

Youhe Jiang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)