Yineng Zhang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Inference & Quantization (1)Open-Source Models & Weights (1)Training Efficiency & Optimization (1)

Frequent co-authors

Shenggui Li (1)Chao Wang (1)Yikai Zhu (1)Yikai Zhu (1)

Papers (2)

Mar 19, 2026

Mar 19, 2026·also Independent Researcher, JD.com

SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding

Training speculative decoding models just got an order of magnitude faster, unlocking real-world deployment with a new open-source framework and a suite of production-ready draft models.

Shenggui Li, Chao Wang, Yikai Zhu +27

Inference & Quantization Open-Source Models & Weights Training Efficiency & Optimization

2026

Shanli Xing +12Jan 1, 2026

FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems

LLMs can now autonomously generate and deploy GPU kernels into production LLM engines, thanks to a new standardized framework for benchmarking and integrating these AI-generated kernels.

Shanli Xing, Yiyan Zhai, Alexander Jiang +10

Search

Yineng Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)