Yingqi Xie

Papers on Lattice

Total citations

Topics

Research focus

RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Yiran Guo (1)Zhongjian Qiao (1)Dan Ye (1)Lijie Xu (1)

Papers (1)

Feb 15, 2026

Tsinghua AIFeb 15, 2026·also CAS, M steps for a fair comparison.

Deep Dense Exploration for LLM Reinforcement Learning via Pivot-Driven Resampling

By strategically resampling from deep, recoverable states ("pivots") within unsuccessful trajectories, DDE drastically improves LLM reinforcement learning compared to methods that oversample from the root or blindly disperse budgets.

Yiran Guo, Zhongjian Qiao, Yingqi Xie +2

RLHF & Preference Learning Tool Use & Agents

Search

Yingqi Xie

Research focus

Frequent co-authors

Papers (1)