Yuang Chen

Chinese University of Hong Kong, Hong Kong, China

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)

Frequent co-authors

Wenqi Zeng (1)Jeffrey Xu Yu (1)

Papers (1)

Jan 28, 2026

Jan 28, 2026·also HKUST

High-Throughput Non-uniformly Quantized 3-bit LLM Inference

Naive quantization can paradoxically *slow down* LLM inference, but Quantix flips the script with 11x speedups via hardware-aware data layout and kernel fusion.

Yuang Chen, Wenqi Zeng, Jeffrey Xu Yu

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization

Search

Yuang Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)