Shouxu Lin

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Zhiyuan Guo (1)Jiaxin Lin (1)

Papers (1)

Apr 28, 2026

Shouxu Lin +23w ago

DAK: Direct-Access-Enabled GPU Memory Offloading with Optimal Efficiency for LLM Inference

Forget prefetching: DAK unlocks up to 3x faster LLM inference by enabling direct GPU access to remote memory, achieving near-optimal system bandwidth utilization.

Shouxu Lin, Zhiyuan Guo, Jiaxin Lin

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Shouxu Lin

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)