Karan Sapra

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (1)Training Efficiency & Optimization (1)Data Curation & Synthetic Data (1)Tool Use & Agents (1)

Frequent co-authors

Ximing Lu (2)Andrew Tao (2)Yejin Choi (2)Byung-Kwan Lee (1)

Papers (2)

Jun 16, 2026

AI22d ago·also NVIDIA, UIUC

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

ZPPO reveals that embedding teacher responses in prompts rather than gradients can dramatically boost the performance of small student models on challenging tasks.

Byung-Kwan Lee, Ximing Lu, Shizhe Diao +8

RLHF & Preference Learning Training Efficiency & Optimization

Jun 15, 2026

AI23d ago·also NVIDIA, UCSD

ProCUA-SFT Technical Report

Fine-tuning on the new ProCUA-SFT dataset boosts UI-TARS 7B's performance from a dismal 8-10% to an impressive 45.0% on OSWorld tasks, highlighting the critical role of high-quality training data.

Jaehun Jung, Ximing Lu, Brandon Cui +11

Data Curation & Synthetic Data Tool Use & Agents

Search

Karan Sapra

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)