Lattice AI Research

Research focus

Eval Frameworks & Benchmarks (1)Red-Teaming & Adversarial Robustness (1)Tool Use & Agents (1)Architecture Design (Transformers, SSMs, MoE) (1)Code Generation & Program Synthesis (1)

Frequent co-authors

Bowen Ye (1)Rang Li (1)Qibin Yang (1)Yuanxin Liu (1)

Papers (2)

Apr 7, 2026

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Current autonomous agent benchmarks miss nearly half of safety violations and over 10% of robustness failures because they only check final outputs, a problem Claw-Eval directly addresses.

Bowen Ye, Rang Li, Qibin Yang +8

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

Sep 1, 2025

Dream-Coder 7B: An Open Diffusion Language Model for Code

Forget left-to-right: Dream-Coder 7B's diffusion approach lets it generate code in *any* order, adapting its strategy to the task at hand.

Zhihui Xie, Jiacheng Ye, Lin Zheng +833

Architecture Design (Transformers, SSMs, MoE)Code Generation & Program Synthesis Open-Source Models & Weights

Search

Zhihui Xie

Research focus

Frequent co-authors

Papers (2)