Lattice AI Research

Research focus

Reasoning & Chain-of-Thought (2)RLHF & Preference Learning (2)Tool Use & Agents (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Futing Wang (1)Yun Luo (1)Ganqu Cui (1)Zhi Wang (1)

Papers (2)

Feb 12, 2026

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

LLMs can be taught to "think longer" and explore more diverse reasoning paths in-context via a simple length-incentivized reward, leading to improved generalization.

Futing Wang, Jianhao Yan, Yun Luo +5

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Feb 12, 2026·also Westlake

Detecting RLVR Training Data via Structural Convergence of Reasoning

RLVR training leaves a tell-tale sign: prompts encountered during fine-tuning produce unusually similar reasoning trajectories, detectable without access to model internals.

Hongbo Zhang, Yue Yang, Jianhao Yan +1

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Jianhao Yan

Research focus

Frequent co-authors

Papers (2)