Junhong Wu

Papers on Lattice

Total citations

Topics

h-index

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Jingcheng Deng (1)Zihao Wei (1)Liang Pang (1)Shicheng Xu (1)

Papers (1)

Apr 30, 2026

Latent-GRPO: Group Relative Policy Optimization for Latent Reasoning

Latent reasoning can now outperform explicit reasoning in complex tasks, thanks to a new RL method that stabilizes training by explicitly handling issues like invalid latent states and misaligned token-level updates.

Jingcheng Deng, Zihao Wei, Liang Pang +4

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Junhong Wu

Research focus

Frequent co-authors

Papers (1)