Weimin Xiong

School of Computer Science, Peking University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Inference & Quantization (1)Scaling Laws & Emergent Abilities (1)

Frequent co-authors

Jiebin Zhang (1)Zhenghan Yu (1)Eugene J. Yu (1)Zheng Li (1)

Papers (1)

Jun 1, 2026

Jun 1, 2026·also Key Laboratory of Computational, Tencent AI, UIUC

DFlare: Scaling Up Draft Capacity for Block Diffusion Speculative Decoding

DFlare achieves up to 5.52x speedup in LLM inference by allowing draft layers to independently leverage richer target knowledge, breaking through previous capacity constraints.

Jiebin Zhang, Zhenghan Yu, Eugene J. Yu +6

Inference & Quantization Scaling Laws & Emergent Abilities

Search

Weimin Xiong

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)