Zhongzhi Luan

Sino-German Joint Software Institute, Beihang University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (2)Training Efficiency & Optimization (1)Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)

Frequent co-authors

Shiqing Ma (2)Jiaxing Qi (2)Bin Han (2)Hailong Yang (2)

Papers (2)

Jun 9, 2026

2d ago

RATrain: A Resource-Aware Training Runtime for Large Language Models on Bandwidth-Constrained Heterogeneous Supercomputing Platforms

RATrain achieves a remarkable 1.35× speedup in training large language models on bandwidth-limited supercomputers, challenging the assumption that high-bandwidth environments are necessary for optimal performance.

Yao Lu, Shiqing Ma, Zhongzhi Luan +5

Distributed Systems & Hardware Training Efficiency & Optimization

May 25, 2026

2w ago

Bandwidth-Aware LLM Inference on Heterogeneous Many-Core Supercomputers

LLM inference on supercomputers doesn't have to be a bottleneck: THInfer achieves up to 84% higher throughput than A800 GPUs by co-designing hardware-aware kernels and a communication-optimized pipeline.

Zhongzhi Luan, Jiaxing Qi, Shiqing Ma +4

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Zhongzhi Luan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)