Xunliang Cai

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Training Efficiency & Optimization (2)RLHF & Preference Learning (1)Tool Use & Agents (1)Robotics & Embodied AI (1)

Frequent co-authors

Qi Gu (2)Zhengxi Lu (1)Zhiyuan Yao (1)Zhiyuan Yao (1)

Papers (2)

Apr 2, 2026

Zhengxi Lu +13Apr 2, 2026

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

LLM agents can internalize skills via in-context RL, achieving zero-shot autonomous behavior without the token overhead and retrieval noise of traditional methods.

Zhengxi Lu, Zhiyuan Yao, Zhiyuan Yao +11

RLHF & Preference Learning Tool Use & Agents Training Efficiency & Optimization

Mar 11, 2026

Yikai Zhang +6Mar 11, 2026·also Plus MMStar RealWorldQA Method

$V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts

Forget hand-tuning rollout budgets: $V_{0.5}$ dynamically allocates compute to sparse RL rollouts based on a real-time statistical test of a generalist value model's prior, slashing variance and boosting performance.

Yikai Zhang, Yueqing Sun, Hongyan Hao +4

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Search

Xunliang Cai

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)