Byung-Kwan Lee

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (1)Training Efficiency & Optimization (1)Multimodal Models (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Yu-Chiang Frank Wang (2)Ximing Lu (1)Shizhe Diao (1)Minki Kang (1)

Papers (2)

Jun 16, 2026

AI2Jun 16, 2026·also NTU Taiwan

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

ZPPO reveals that embedding teacher responses in prompts rather than gradients can dramatically boost the performance of small student models on challenging tasks.

Byung-Kwan Lee, Ximing Lu, Shizhe Diao +4

RLHF & Preference Learning Training Efficiency & Optimization

Jun 11, 2026

Seokju Cho +19Jun 11, 2026·also NVIDIA, KAIST, NTU Taiwan

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

SpatialClaw enables agents to dynamically compose and adapt their reasoning strategies, achieving a remarkable 11.2-point accuracy boost over traditional spatial agents.

Seokju Cho, Seokju Cho, Ryo Hachiuma +17

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

Search

Byung-Kwan Lee

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)