Yonggang Wen

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Inference & Quantization (2)Training Efficiency & Optimization (2)Open-Source Models & Weights (1)Distributed Systems & Hardware (1)

Frequent co-authors

Shenggui Li (1)Chao Wang (1)Yikai Zhu (1)Yikai Zhu (1)

Papers (2)

Mar 19, 2026

Mar 19, 2026·also Independent Researcher, JD.com

SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding

Training speculative decoding models just got an order of magnitude faster, unlocking real-world deployment with a new open-source framework and a suite of production-ready draft models.

Shenggui Li, Chao Wang, Yikai Zhu +27

Inference & Quantization Open-Source Models & Weights Training Efficiency & Optimization

Mar 5, 2026

Peng Sun +3Mar 5, 2026·also NTU

PromptTuner: SLO-Aware Elastic System for LLM Prompt Tuning

PromptTuner slashes SLO violations by up to 7.9x and costs by 4.5x in LLM prompt tuning, outperforming existing resource management systems.

Peng Sun, D. Ustiugov, Dmitrii Ustiugov +1

Distributed Systems & Hardware Inference & Quantization Training Efficiency & Optimization

Search

Yonggang Wen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)