Deqing Wang

Papers on Lattice

Total citations

Topics

h-index

Research focus

Training Efficiency & Optimization (2)Robotics & Embodied AI (1)Tool Use & Agents (1)Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Zixuan Huang (3)Fuzhen Zhuang (3)Yikun Ban (3)Xin Xia (2)

Papers (3)

Mar 3, 2026

Zhixia Zhang +9Mar 3, 2026

Heterogeneous Agent Collaborative Reinforcement Learning

Heterogeneous agents can boost each other's performance in RL without coordinated deployment, achieving better results with less data than traditional methods.

Zhixia Zhang, Zixuan Huang, X. Xia +7

Robotics & Embodied AI Tool Use & Agents Training Efficiency & Optimization

Feb 9, 2026

Feb 9, 2026·also UQ

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

LRMs already know when to stop reasoning, but current sampling methods are holding them back.

Zixuan Huang, Xin Xia, Yuxi Ren +11

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Training Efficiency & Optimization

Jan 30, 2026

Zixuan Huang +12Jan 30, 2026·also UQ

Real-Time Aligned Reward Model beyond Semantics

Stop overfitting your reward model: R2M leverages real-time policy feedback to dynamically align the reward model with the evolving policy distribution, reducing reward overoptimization in RLHF.

Zixuan Huang, Xin Xia, Yuxi Ren +10

RLHF & Preference Learning

Search

Deqing Wang

Research focus

Frequent co-authors

Papers (3)