Lattice AI Research

Research focus

RLHF & Preference Learning (2)Tool Use & Agents (2)Code Generation & Program Synthesis (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Cursor Reseach Aaron Chan (1)Ahmed Shalaby (1)Alexander Wettig (1)Aman Sanger (1)

Papers (2)

Mar 25, 2026

BAIRMar 25, 2026·also Microsoft Research, IIT

Composer 2 Technical Report

Training domain-specific coding LLMs with realistic environments and large-scale RL can yield substantial gains in practical software engineering tasks.

Cursor Reseach Aaron Chan, Ahmed Shalaby, Alexander Wettig +51

Code Generation & Program Synthesis RLHF & Preference Learning Tool Use & Agents

Mar 7, 2026

Huihan Tan +9Mar 7, 2026

Hindsight Credit Assignment for Long-Horizon LLM Agents

LLM agents can learn to solve complex, long-horizon tasks much more effectively by using themselves as post-hoc critics to refine Q-values through hindsight reasoning.

Huihan Tan, Xiao-Wen Yang, Hao Chen +7

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Hao Chen

Research focus

Frequent co-authors

Papers (2)