Haixin Wang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (2)Training Efficiency & Optimization (2)RLHF & Preference Learning (1)Robotics & Embodied AI (1)

Frequent co-authors

Hejie Cui (1)Chenwei Zhang (1)Shuowei Jin (1)Shijie Geng (1)

Papers (2)

May 4, 2026

Haixin Wang +8May 4, 2026·also HKU

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Multi-turn RL agents can learn far more effectively by explicitly monitoring and controlling uncertainty at both the token and turn levels, leading to more stable training and higher performance.

Haixin Wang, Hejie Cui, Chenwei Zhang +6

RLHF & Preference Learning Tool Use & Agents Training Efficiency & Optimization

Feb 25, 2026

Feb 25, 2026·also UCLA

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

ARLArena reveals the hidden instability of agentic RL, offering a path to more reliable LLM-based agents via a novel stable policy optimization method (SAMPO).

Xiaoxuan Wang, Xiaoxuan Wang, Haixin Wang +15

Robotics & Embodied AI Tool Use & Agents Training Efficiency & Optimization

Search

Haixin Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)