Quan Chen

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Zhenghao Gan (1)Z. Gan (1)Yichen Bao (1)Yifei Liu (1)

Papers (1)

Mar 9, 2026

Tsinghua AIMar 9, 2026

SageSched: Efficient LLM Scheduling Confronting Demand Uncertainty and Hybridity

Beat the LLM inference bottleneck: SageSched's uncertainty-aware scheduling boosts efficiency by nearly 30% by predicting output length and balancing compute and memory demands.

Zhenghao Gan, Z. Gan, Yichen Bao +4

Distributed Systems & Hardware Inference & Quantization

Search

Quan Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)