Lattice AI Research

Research focus

Distributed Systems & Hardware (2)Training Efficiency & Optimization (2)Inference & Quantization (1)Code Generation & Program Synthesis (1)

Frequent co-authors

Chon Lam Lao (1)Zhiying Xu (1)Zhuang Wang (1)Ziming Mao (1)

Papers (2)

Apr 19, 2026

Chon Lam Lao +96d ago·also Shenzhen University

CCCL: In-GPU Compression-Coupled Collective Communication

Get up to 10% more throughput on your LLM disaggregation workloads just by swapping in this drop-in collective communications library with built-in compression.

Chon Lam Lao, Zhiying Xu, Zhuang Wang +7

Distributed Systems & Hardware Inference & Quantization Training Efficiency & Optimization

Nov 19, 2025

Stanford HAINov 19, 2025·also Amazon Science, Google Research, Microsoft Research, UW +4

AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization

Open-source LLMs can now autonomously optimize AI accelerator kernels, matching the performance of proprietary models at a fraction of the cost.

Genghan Zhang, Genghan Zhang, Shaowei Zhu +15

Code Generation & Program Synthesis Distributed Systems & Hardware Tool Use & Agents+1

Yida Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)

Search

Yida Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)