Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Jianjiang Li | Lattice

Jianjiang Li

University of Science and Technology Beijing

Papers on Lattice

1

Total citations

0

Topics

3

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Chia-chi Hsieh (1)Chia-chi Hsieh (1)Zan Zong (1)Zan Zong (1)

Papers (1)

Feb 18, 2026

Tsinghua AIFeb 18, 2026·also HIT, TJU, USTB

FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving

LLM serving can achieve 5.6x higher throughput without sacrificing latency by decoupling preemption granularity from scheduling frequency.

Chia-chi Hsieh, Chia-chi Hsieh, Zan Zong +9

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Xinyang Chen (1)

Xinyang Chen (1)

Xinyang Chen (1)

Jianjiang Li (1)