Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Guang Huang | Lattice

Guang Huang

Papers on Lattice

1

Total citations

0

Topics

3

h-index

0

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Zeyi Wen (1)Zeyi Wen (1)

Papers (1)

Mar 2, 2026

Guang Huang +22w ago

Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient Verification

Quantization can halve memory traffic during speculative decoding's verification stage, boosting end-to-end throughput by 28% without retraining.

Guang Huang, Zeyi Wen, Zeyi Wen

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization