Luhong Liang

Papers on Lattice

Total citations

Topics

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Distributed Systems & Hardware (2)Inference & Quantization (2)

Frequent co-authors

Yuzhong Jiao (2)Songchen Ma (1)Hongyi Li (1)Weihao Zhang (1)

Papers (2)

Mar 29, 2026

Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling

Multi-chiplet architectures can unlock significant speedups and memory savings for low-batch MoE inference by dynamically scheduling expert computations across high-bandwidth die-to-die links.

Songchen Ma, Hongyi Li, Weihao Zhang +7

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Feb 24, 2026

Towards Secure and Efficient DNN Accelerators via Hardware-Software Co-Design

Securing DNN accelerators doesn't have to break the bank: this co-design framework slashes memory overhead by 87% while boosting performance by 12%.

Wei Xuan, Zihao Xuan, Rongliang Fu +9

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Luhong Liang

Research focus

Frequent co-authors

Papers (2)