Kang Zhu

Nanjing University

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Aojie Jiang (1)Zhiheng Zhang (1)Zhengxu Su (1)Zhe Su (1)

Papers (1)

Mar 30, 2026

2d ago·also China Mobile

A Switch-Centric In-Network Architecture for Accelerating LLM Inference in Shared-Memory Network

Forget GPU-centric All-Reduce: SCIN's switch-based architecture slashes latency by up to 8.7x and boosts LLaMA-2 performance by 34% through in-network quantization.

Aojie Jiang, Kang Zhu, Zhiheng Zhang +5

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Kang Zhu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)