Junhong Wu

University of Chinese Academy of Sciences, Chinese Academy of Sciences

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Jingcheng Deng (1)Zihao Wei (1)Liang Pang (1)Shicheng Xu (1)

Papers (1)

Apr 30, 2026

1d ago

Latent-GRPO: Group Relative Policy Optimization for Latent Reasoning

Latent reasoning, previously unstable in RL, can now outperform explicit reasoning while using 3-4x shorter chains, thanks to a new method that stabilizes latent space exploration.

Jingcheng Deng, Zihao Wei, Liang Pang +4

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Junhong Wu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)