Zifan He

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (1)Inference & Quantization (1)Recommendation & Information Retrieval (1)

Frequent co-authors

Rui Ma (1)Yizhou Sun (1)Jason Cong (1)

Papers (1)

Mar 30, 2026

Zifan He +32d ago

Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference

LLM inference bottlenecks aren't just compute-bound: heterogeneous GPU-FPGA systems can slash memory processing overheads by up to 2x while simultaneously reducing energy consumption.

Zifan He, Rui Ma, Yizhou Sun +1

Distributed Systems & Hardware Inference & Quantization Recommendation & Information Retrieval

Search

Zifan He

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)