Lattice AI Research

Research focus

RLHF & Preference Learning (2)Natural Language Processing (1)Training Efficiency & Optimization (1)Tool Use & Agents (1)

Frequent co-authors

Jiashu Yao (2)Zeming Liu (2)Yuhang Guo (2)Chuwei Luo (1)

Papers (3)

Apr 13, 2026

6d ago·also Beihang, ZJU

Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization

Forget monolithic policies – splitting your LLM's RL policy into accuracy-focused and exploration-driven modes unlocks better performance and diversity.

Jiashu Yao, Heyan Huang, Chuwei Luo +4

Natural Language Processing RLHF & Preference Learning Training Efficiency & Optimization

6d ago·also Beihang

Utilizing and Calibrating Hindsight Process Rewards via Reinforcement with Mutual Information Self-Evaluation

Open-source 7B LLMs can now rival GPT-4o performance on validation tasks, thanks to a novel reinforcement learning approach that leverages calibrated self-evaluation as a dense reward signal.

Jiashu Yao, Heyan Huang, Zeming Liu +1

RLHF & Preference Learning Tool Use & Agents

Apr 6, 2026

1w ago

How Far Are We? Systematic Evaluation of LLMs vs. Human Experts in Mathematical Contest in Modeling

LLMs ace the setup but fumble the execution in mathematical modeling, revealing a critical gap that scaling alone won't fix.

Yuhang Liu, Heyan Huang, Yizhe Yang +3

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought

Search

Heyan Huang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)