Xiaodong Lu

Papers on Lattice

Total citations

Topics

Research focus

Data Curation & Synthetic Data (1)RLHF & Preference Learning (1)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Xiaohan Wang (1)Yikun Ban (1)Jiajun Chai (1)Tianhao Peng (1)

Papers (1)

May 25, 2026

May 25, 2026·also NTU

When Self-Belief Misleads: Active Label Acquisition for Reinforcement Learning with Verifiable Rewards

Overcome the prohibitive cost of ground-truth labels in reinforcement learning by actively acquiring labels for only the most valuable samples, leading to stable training and improved performance even with limited annotation budgets.

Xiaodong Lu, Xiaohan Wang, Yikun Ban +3

Data Curation & Synthetic Data RLHF & Preference Learning Scalable Oversight & Alignment Theory

Search

Xiaodong Lu

Research focus

Frequent co-authors

Papers (1)