Yun Xiong

Papers on Lattice

Total citations

Topics

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Siwei Zhang (1)Zi'an Jia (1)Renhong Huang (1)Jiarong Xu (1)

Papers (1)

Mar 3, 2026

Siwei Zhang +4Mar 3, 2026·also Fudan, ZJU

RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization

LLM agents can explore more effectively by retrieving and reasoning over off-policy step-level traces, leading to significant performance gains and faster training.

Siwei Zhang, Yun Xiong, Zi'an Jia +2

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Yun Xiong

Research focus

Frequent co-authors

Papers (1)