Andrew Tao

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (2)RLHF & Preference Learning (1)Training Efficiency & Optimization (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Yejin Choi (3)Ximing Lu (2)Saurav Muralidharan (2)Karan Sapra (2)

Papers (3)

Jun 16, 2026

AI22d ago·also NVIDIA, UIUC

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

ZPPO reveals that embedding teacher responses in prompts rather than gradients can dramatically boost the performance of small student models on challenging tasks.

Byung-Kwan Lee, Ximing Lu, Shizhe Diao +8

RLHF & Preference Learning Training Efficiency & Optimization

Jun 15, 2026

AI23d ago·also NVIDIA, UCSD

ProCUA-SFT Technical Report

Fine-tuning on the new ProCUA-SFT dataset boosts UI-TARS 7B's performance from a dismal 8-10% to an impressive 45.0% on OSWorld tasks, highlighting the critical role of high-quality training data.

Jaehun Jung, Ximing Lu, Brandon Cui +11

Data Curation & Synthetic Data Tool Use & Agents

Jun 12, 2026

AI26d ago·also NVIDIA, HKUST, Institute of Medical Technology, Motional +3

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Achieving six times the inference throughput of current LLMs while maintaining accuracy, Nemotron 3 Ultra redefines performance benchmarks for agentic reasoning tasks.

NVIDIA, Aaron Blakeman, Aaron Thomas +570

Architecture Design (Transformers, SSMs, MoE)Scaling Laws & Emergent Abilities Tool Use & Agents

Search

Andrew Tao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)