Ziming Mao

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (2)Inference & Quantization (1)Training Efficiency & Optimization (1)Code Generation & Program Synthesis (1)

Frequent co-authors

Chon Lam Lao (1)Zhiying Xu (1)Zhuang Wang (1)Delong Meng (1)

Papers (2)

Apr 19, 2026

Chon Lam Lao +96d ago·also Shenzhen University

CCCL: In-GPU Compression-Coupled Collective Communication

Get up to 10% more throughput on your LLM disaggregation workloads just by swapping in this drop-in collective communications library with built-in compression.

Chon Lam Lao, Zhiying Xu, Zhuang Wang +7

Distributed Systems & Hardware Inference & Quantization Training Efficiency & Optimization

Feb 22, 2026

Shiyi Cao +7Feb 22, 2026·also BAIR

K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

LLMs can now design GPU kernels that outperform both human experts and prior automated methods, thanks to a co-evolving world model that guides the search process.

Shiyi Cao, Shiyi Cao, Ziming Mao +5

Code Generation & Program Synthesis Distributed Systems & Hardware World Models & Planning