Hejie Cui

Papers on Lattice

Total citations

Topics

Research focus

RLHF & Preference Learning (1)Tool Use & Agents (1)Training Efficiency & Optimization (1)

Frequent co-authors

Chenwei Zhang (1)Shuowei Jin (1)Shijie Geng (1)Xinyang Zhang (1)

Papers (1)

May 4, 2026

Hejie Cui +7May 4, 2026·also HKU

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Multi-turn RL agents can learn far more effectively by explicitly monitoring and controlling uncertainty at both the token and turn levels, leading to more stable training and higher performance.

Hejie Cui, Chenwei Zhang, Shuowei Jin +5

RLHF & Preference Learning Tool Use & Agents Training Efficiency & Optimization

Search

Hejie Cui

Research focus

Frequent co-authors

Papers (1)