Lattice AI Research

Research focus

Distributed Systems & Hardware (3)Tool Use & Agents (2)Training Efficiency & Optimization (2)Code Generation & Program Synthesis (2)

Frequent co-authors

Qiuyang Mang (2)Ion Stoica (2)Xueshen Liu (1)Yongji Wu (1)

Papers (4)

Apr 8, 2026

2w ago·also BAIR

Foundry: Template-Based CUDA Graph Context Materialization for Fast LLM Serving Cold Start

Cut LLM cold starts from minutes to seconds by pre-materializing CUDA graph execution contexts, sidestepping brittle kernel patching and heavyweight checkpointing.

Xueshen Liu, Yongji Wu, Yuncheng Yao +6

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Apr 5, 2026

Stanford HAI2w ago·also Amazon Science, BAIR

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

Scaling prompt learning by 17x without sacrificing accuracy is now possible, unlocking efficient self-improvement for LLM agents.

Hanchen Li, Runyuan He, Qizheng Zhang +13

Distributed Systems & Hardware Scaling Laws & Emergent Abilities Tool Use & Agents+1

Feb 23, 2026

BAIRFeb 23, 2026·also Databricks

AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization

LLM-driven program evolution gets a smart upgrade: AdaEvolve dynamically allocates resources to promising solution candidates, leaving static schedules in the dust.

Mert Cemri, M. Cemri, Shubham Agrawal +19

Code Generation & Program Synthesis Tool Use & Agents Training Efficiency & Optimization

Feb 22, 2026

Shiyi Cao +7Feb 22, 2026·also BAIR

K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

LLMs can now design GPU kernels that outperform both human experts and prior automated methods, thanks to a co-evolving world model that guides the search process.

Shiyi Cao, Shiyi Cao, Ziming Mao +5

Code Generation & Program Synthesis Distributed Systems & Hardware World Models & Planning

Search

Ion Stoica

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)