Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Rishabh Tiwari | Lattice

Rishabh Tiwari

Papers on Lattice

1

Total citations

10

Topics

2

h-index

2

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)

Frequent co-authors

Haocheng Xi (1)Aditya Tomar (1)Coleman Hooper (1)Sehoon Kim (1)

Papers (1)

Feb 5, 2025

Rishabh Tiwari +9Feb 5, 2025·also BAIR

QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache

Forget sparse KV caches – QuantSpec's hierarchical 4-bit quantization unlocks 2.5x speedups in long-context LLM inference with >90% acceptance rates.

Rishabh Tiwari, Haocheng Xi, Aditya Tomar +710

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization

Max Horton (1)

Mahyar Najibi (1)

Michael W. Mahoney (1)

Kurt Keutzer (1)