Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Jiawei Jiang | Lattice

Jiawei Jiang

Papers on Lattice

1

Total citations

0

Topics

2

h-index

0

Research focus

Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Haoyu Zheng (1)Yongqiang Zhang (1)Fangcheng Fu (1)Xiaokai Zhou (1)

Papers (1)

Apr 1, 2026

Haoyu Zheng +9Apr 1, 2026·also WHU

Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions

Stop guessing how long LLM outputs will be – modeling the *distribution* of possible lengths slashes latency by 2x and boosts throughput by 40%.

Haoyu Zheng, Yongqiang Zhang, Fangcheng Fu +7

Distributed Systems & Hardware Inference & Quantization

Hongchao Zhu (1)

Yuanyuan Zhu (1)