Lattice AI Research

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)Data Curation & Synthetic Data (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Chang Han (1)Jingling Liu (1)Jinglin Liu (1)Weiqi Zhai (1)

Papers (3)

Mar 9, 2026

Chang Han +3Mar 9, 2026

EAGLE-Pangu: Accelerator-Safe Tree Speculative Decoding on Ascend NPUs

Tree speculative decoding can achieve up to 2.46x speedup on Ascend NPUs, but only if you carefully manage the branch/commit cache and eliminate undefined negative indices.

Chang Han, Yijie Hu, Jingling Liu +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Feb 15, 2026

DAMOFeb 15, 2026·also MIT CSAIL, Tsinghua AI, Fudan, Huawei +1

HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

LLM benchmark accuracy jumps 10% when evaluated on a cleaned-up version of Humanity's Last Exam, highlighting the significant impact of dataset noise on performance metrics.

Weiqi Zhai, Weiqi Zhai, Zhihai Wang +44

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Natural Language Processing

Dec 31, 2025

Dec 31, 2025·also CAS, ECNU, Fudan, GIST Guangdong +6

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

An open-source ecosystem for agentic learning, complete with a trained agent and novel policy optimization, promises to accelerate research by providing a standardized, scalable platform.

Weixun Wang, Xiaoxiao Xu, Wanhe An +85

Open-Source Models & Weights Tool Use & Agents

Search

Yijie Hu

Research focus

Frequent co-authors

Papers (3)