Lattice AI Research

Research focus

Architecture Design (Transformers, SSMs, MoE) (3)Distributed Systems & Hardware (3)Inference & Quantization (2)Code Generation & Program Synthesis (1)

Frequent co-authors

Weichuang Zhang (1)Weichuan Zhang (1)Yiquan Wang (1)Xinzhou Zhang (1)

Papers (4)

Apr 14, 2026

Weichuang Zhang +102w ago

CODO: An Automated Compiler for Comprehensive Dataflow Optimization

Forget hand-tuning: CODO automatically compiles efficient FPGA dataflow accelerators, delivering up to 33.8x speedups on DNN models compared to existing frameworks.

Weichuang Zhang, Weichuan Zhang, Yiquan Wang +8

Architecture Design (Transformers, SSMs, MoE)Code Generation & Program Synthesis Distributed Systems & Hardware

Apr 8, 2026

3w ago·also NUS, ByteDance, HKUST, NJU +1

InfiniLoRA: Disaggregated Multi-LoRA Serving for Large Language Models

Serving LoRA adapters at scale doesn't have to crush your latency SLOs: InfiniLoRA disaggregates LoRA execution to achieve 3x higher throughput and dramatically improved tail latency.

Hongyu Chen, Letian Ruan, Zilin Xu +5

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Mar 30, 2026

Kangkang Sun +4Mar 30, 2026

CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems

Semantic disagreement between LLMs reveals crucial uncertainty that single-model metrics miss, and Collaborative Entropy (CoE) captures it.

Kangkang Sun, Jianhua Li, Minyi Guo +2

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Mar 11, 2026

Tsinghua AIMar 11, 2026·also SJTU

S-HPLB: Efficient LLM Attention Serving via Sparsity-Aware Head Parallelism Load Balance

Exploit the surprisingly stable, yet heterogeneous, sparsity patterns across attention heads to slash LLM attention latency by 2.88x without sacrificing quality.

Chen Chen, Zhibin Yu, Quan Chen +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Minyi Guo

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)