Chris Lott

Papers on Lattice

Total citations

Topics

Research focus

Inference & Quantization (2)Natural Language Processing (1)Training Efficiency & Optimization (1)Architecture Design (Transformers, SSMs, MoE) (1)Interpretability & Mechanistic Interp (1)

Frequent co-authors

Raghavv Goel (1)Mukul Gagrani (1)Mingu Lee (1)Christopher M. Lott (1)

Papers (2)

Mar 18, 2026

Raghavv Goel +4Mar 18, 2026

Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing

LLMs can predict multiple tokens in parallel without any training, simply by cleverly probing their embedding space with dynamically generated mask tokens.

Raghavv Goel, Mukul Gagrani, Mingu Lee +2

Inference & Quantization Natural Language Processing Training Efficiency & Optimization

Mar 8, 2026

Sudhanshu Agrawal +2Mar 8, 2026

Skip to the Good Part: Representation Structure & Inference-Time Layer Skipping in Diffusion vs. Autoregressive LLMs

Diffusion language models have surprisingly redundant early layers, enabling nearly 20% FLOPs reduction at inference time via layer skipping without sacrificing performance.

Sudhanshu Agrawal, Chris Lott, Fatih Porikli

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Interpretability & Mechanistic Interp

Search

Chris Lott

Research focus

Frequent co-authors

Papers (2)