Zhihao Shu

Papers on Lattice

Total citations

Topics

Research focus

Distributed Systems & Hardware (1)Inference & Quantization (1)Training Efficiency & Optimization (1)

Frequent co-authors

Md Musfiqur Rahman Sanim (1)Hangyu Zheng (1)Kunxiong Zhu (1)Gagan Agrawal (1)

Papers (1)

Feb 17, 2026

Zhihao Shu +4Feb 17, 2026

FlashMem: Supporting Modern DNN Workloads on Mobile with GPU Memory Hierarchy Optimizations

Mobile GPUs can now run large DNNs and multi-DNN workloads efficiently thanks to FlashMem, which slashes memory consumption by up to 8.4x and accelerates inference by up to 75x.

Zhihao Shu, Md Musfiqur Rahman Sanim, Hangyu Zheng +2

Distributed Systems & Hardware Inference & Quantization Training Efficiency & Optimization

Search

Zhihao Shu

Research focus

Frequent co-authors

Papers (1)