Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Yiren Zhao | Lattice

Yiren Zhao

Imperial College London

Papers on Lattice

1

Total citations

0

Topics

3

h-index

13

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Zhiwen Mo (1)Robert D. Mullins (1)

Papers (1)

Feb 12, 2026

Feb 12, 2026

Deep Kernel Fusion for Transformers

Fusing kernels in SwiGLU MLP blocks slashes memory bandwidth bottlenecks, yielding up to 13.2% speedups on H100 GPUs during agentic LLM inference.

Zhiwen Mo, Yiren Zhao, Robert D. Mullins

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization