Yun Luo

Papers on Lattice

Total citations

Topics

h-index

Research focus

Reasoning & Chain-of-Thought (2)Inference & Quantization (1)Training Efficiency & Optimization (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Qingyang Zhang (1)Xinke Kong (1)Haitao Wu (1)Qinghua Hu (1)

Papers (2)

Apr 21, 2026

Qingyang Zhang +9Apr 21, 2026

TEMPO: Scaling Test-time Training for Large Reasoning Models

Test-time training can finally scale for large reasoning models: TEMPO unlocks sustained performance gains by interleaving policy refinement with periodic critic recalibration, boosting accuracy by over 18% on challenging benchmarks.

Qingyang Zhang, Xinke Kong, Haitao Wu +7

Inference & Quantization Reasoning & Chain-of-Thought Training Efficiency & Optimization

Feb 12, 2026

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

LLMs can be taught to "think longer" and explore more diverse reasoning paths in-context via a simple length-incentivized reward, leading to improved generalization.

Futing Wang, Jianhao Yan, Yun Luo +5

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Yun Luo

Research focus

Frequent co-authors

Papers (2)