Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Marco Chen | Lattice

Marco Chen

Berkeley AI Research (BAIR)

Papers on Lattice

1

Total citations

0

Topics

3

h-index

1

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)Training Efficiency & Optimization (1)

Frequent co-authors

Jintao Zhang (1)Jintao Zhang (1)Haoxu Wang (1)Kai Jiang (1)

Papers (1)

Mar 2, 2026

BAIRMar 2, 2026·also Tsinghua AI

SageBwd: A Trainable Low-bit Attention

Trainable INT8 attention can match full-precision attention during pre-training, but only if you normalize QK and reduce tokens per step.

Jintao Zhang, Jintao Zhang, Marco Chen +6

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Ion Stoica (1)

Joseph E. Gonzalez (1)

Jianfei Chen (1)