Dawei Zhu

School of Computer Science, Peking University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Inference & Quantization (2)Scaling Laws & Emergent Abilities (1)RLHF & Preference Learning (1)

Frequent co-authors

Jiebin Zhang (2)Zhenghan Yu (2)Eugene J. Yu (2)Yifan Song (2)

Papers (2)

Jun 1, 2026

Jun 1, 2026·also Key Laboratory of Computational, Tencent AI, UIUC

DFlare: Scaling Up Draft Capacity for Block Diffusion Speculative Decoding

DFlare achieves up to 5.52x speedup in LLM inference by allowing draft layers to independently leverage richer target knowledge, breaking through previous capacity constraints.

Jiebin Zhang, Zhenghan Yu, Eugene J. Yu +6

Inference & Quantization Scaling Laws & Emergent Abilities

Mar 2, 2026

Jiebin Zhang +7Mar 2, 2026·also Tsinghua AI, Key Laboratory of Computational, NUDT, PKU +1

Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning

Speculative decoding gets a throughput boost of up to 4.32x by using reinforcement learning to dynamically balance drafting and verification.

Jiebin Zhang, Zhenghan Yu, Eugene J. Yu +5

Inference & Quantization RLHF & Preference Learning

Search

Dawei Zhu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)