Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Lefei Zhang | Lattice

Lefei Zhang

School of Computer Science, Wuhan University, Wuhan, China

Papers on Lattice

2

Total citations

0

Topics

3

h-index

6

Research focus

Inference & Quantization (1)Natural Language Processing (1)Recommendation & Information Retrieval (1)

Frequent co-authors

Zihong Zhang (1)Zihong Zhang (1)Z. Li (1)Zuchao Li (1)

Papers (2)

Apr 16, 2026

Apr 16, 2026·also BUPT, Edinburgh, PKU

RACER: Retrieval-Augmented Contextual Rapid Speculative Decoding

LLM inference gets a 2x speed boost without training, thanks to a clever technique that merges retrieval with logit-based speculation.

Zihong Zhang, Zihong Zhang, Z. Li +4

Inference & Quantization Natural Language Processing Recommendation & Information Retrieval

Feb 6, 2026

NVIDIAFeb 6, 2026·also WHU

Phase transitions in large language model compression

Structural, numerical, and algebraic redundancy across pruning, quantization, and low-rank decomposition techniques are analyzed, enabling a criticality-aware compression framework that achieves near-lossless compression to 10% of the original size.

Ziyang Ma, Zuchao Li, Lefei Zhang +4

Lefei Zhang (1)