Lattice AI Research

Research focus

Eval Frameworks & Benchmarks (1)Natural Language Processing (1)Recommendation & Information Retrieval (1)Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)

Frequent co-authors

Deli Huang (1)Cunguang Wang (1)Hongyin Tang (1)Zhengyang Tang (1)

Papers (2)

May 27, 2026

Deli Huang +14May 27, 2026·also Meituan

ATLAS: All-round Testing of Long-context Abilities across Scales

Long-context LLM rankings dramatically reshuffle when evaluated across a range of context lengths and capabilities, proving that a single headline score is misleading.

Deli Huang, Cunguang Wang, Hongyin Tang +12

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Apr 9, 2026

Apr 9, 2026·also Corresponding author

AsyncTLS: Efficient Generative LLM Inference with Asynchronous Two-level Sparse Attention

Achieve full-attention accuracy with 10x operator speedup and 4.7x throughput improvement in long-context LLM inference by overlapping KV cache transfers with computation.

Yuxuan Hu, Jianchao Tan, Jiaqi Zhang +6

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Wen Zan

Research focus

Frequent co-authors

Papers (2)