Zhang-Wei Hong

Improbable AI Lab, MIT-IBM Computing Research Lab

MIT CSAIL

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (2)Natural Language Processing (1)RLHF & Preference Learning (1)Inference & Quantization (1)

Frequent co-authors

Ryan Bahlous-Boldi (1)Isha Puri (1)Idan Shenfeld (1)Akarsh Kumar (1)

Papers (2)

May 21, 2026

Improbable AI LabMay 21, 2026·also MIT CSAIL

Vector Policy Optimization: Training for Diversity Improves Test-Time Search

LLMs trained with Vector Policy Optimization (VPO) learn to produce diverse solutions that unlock previously unsolvable problems in evolutionary search, outperforming models optimized for single scalar rewards.

Ryan Bahlous-Boldi, Isha Puri, Idan Shenfeld +6

Natural Language Processing RLHF & Preference Learning Tool Use & Agents

Apr 6, 2026

MIT CSAILApr 6, 2026·also Stanford HAI, Improbable AI Lab, UIUC, University of California

Decocted Experience Improves Test-Time Inference in LLM Agents

Forget brute-force scaling: crafting the *right* context from past experiences unlocks surprisingly large gains in LLM agent performance.

Maohao Shen, Kaiwen Zha, Zexue He +6

Inference & Quantization Reasoning & Chain-of-Thought Tool Use & Agents