Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

John Harvill | Lattice

John Harvill

University of Illinois at Urbana-Champaign

Papers on Lattice

1

Total citations

0

Topics

3

h-index

7

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)Natural Language Processing (1)

Frequent co-authors

Zihao Xu (1)John Harvill (1)Ziwei Fan (1)Yizhou Sun (1)

Papers (1)

Apr 16, 2026

Amazon ScienceApr 16, 2026·also JHU, UIUC

Compressing Sequences in the Latent Embedding Space: $K$-Token Merging for Large Language Models

Achieve 75% input length reduction in LLMs with minimal performance loss by compressing token embeddings directly in the latent space.

Zihao Xu, John Harvill, John Harvill +4

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Natural Language Processing