Qing Shen

Papers on Lattice

Total citations

Topics

h-index

Research focus

Distributed Systems & Hardware (1)Eval Frameworks & Benchmarks (1)Inference & Quantization (1)

Frequent co-authors

Zhen Bi (1)Luoyang Sun (1)Jungang Lou (1)

Papers (1)

Feb 12, 2026

Feb 12, 2026·also AI Lab, Li Auto

RooflineBench: A Benchmarking Framework for On-Device LLMs via Roofline Analysis

On-device LLM performance is heavily influenced by sequence length and model depth, with hardware heterogeneity creating efficiency traps that can be mitigated by architectural refinements like Multi-head Latent Attention.

Zhen Bi, Luoyang Sun, Qing Shen +1

Distributed Systems & Hardware Eval Frameworks & Benchmarks Inference & Quantization

Search

Qing Shen

Research focus

Frequent co-authors

Papers (1)