Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Dong Li | Lattice

Dong Li

University of California, Merced

Papers on Lattice

1

Total citations

0

Topics

3

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Mao Lin (1)Mao Lin (1)Xi Wang (1)Guilherme Cox (1)

Papers (1)

Apr 20, 2026

6d ago·also NVIDIA

HybridGen: Efficient LLM Generative Inference via CPU-GPU Hybrid Computing

Squeeze up to 3.2x more performance from your long-context LLMs by intelligently splitting attention computation between CPU and GPU.

Mao Lin, Mao Lin, Xi Wang +6

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Guilherme Cox (1)

Hyeran Jeon (1)

Hyeran Jeon (1)