Guanzheng Chen

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Guanzheng Chen (1)Michael Shieh (1)Michael Qizhe Shieh (1)Lidong Bing (1)

Papers (1)

Mar 2, 2026

Guanzheng Chen +42w ago

LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards

RLHF struggles with long contexts because the reward signal for *finding* the right information vanishes, but can be revived by directly rewarding the model for selecting relevant context.

Guanzheng Chen, Guanzheng Chen, Michael Shieh +2

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Guanzheng Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)