Xiaolong Jin

Papers on Lattice

Total citations

Topics

h-index

Research focus

RLHF & Preference Learning (2)Tool Use & Agents (2)Reasoning & Chain-of-Thought (1)Code Generation & Program Synthesis (1)

Frequent co-authors

Wenxuan Jiang (1)Yuxin Zuo (1)Xuecheng Wu (1)Zi-Qian Fan (1)

Papers (2)

Apr 1, 2026

Wenxuan Jiang +6Apr 1, 2026·also Zhongguancun Laboratory

TR-ICRL: Test-Time Rethinking for In-Context Reinforcement Learning

LLMs can achieve massive performance gains on reasoning and knowledge-intensive tasks simply by iteratively refining their answers using pseudo-labels derived from unlabeled data.

Wenxuan Jiang, Yuxin Zuo, Xuecheng Wu +4

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Mar 10, 2026

Yinjie Wang +3Mar 10, 2026·also Princeton

OpenClaw-RL: Train Any Agent Simply by Talking

Forget finetuning on curated datasets – OpenClaw-RL lets agents learn directly and continuously from *every* interaction, turning user replies, tool outputs, and even GUI changes into valuable RL signals.

Yinjie Wang, Xiaolong Jin, Mengdi Wang +1

Code Generation & Program Synthesis RLHF & Preference Learning Tool Use & Agents

Search

Xiaolong Jin

Research focus

Frequent co-authors

Papers (2)