Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Zhiwen Mo | Lattice

Zhiwen Mo

Papers on Lattice

1

Total citations

0

Topics

3

h-index

0

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Yiren Zhao (1)Robert D. Mullins (1)

Papers (1)

Feb 12, 2026

Feb 12, 2026

Deep Kernel Fusion for Transformers

Fusing kernels in SwiGLU MLP blocks slashes memory bandwidth bottlenecks, yielding up to 13.2% speedups on H100 GPUs during agentic LLM inference.

Zhiwen Mo, Yiren Zhao, Robert D. Mullins

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization