Haobo Wang

Zhejiang University, The Chinese University of Hong Kong ♦, Eastern Institute of Technology

Papers on Lattice

Total citations

Topics

Papers (3)

Jun 7, 2026

1w ago·also CUHK, Eastern Institute of Technology, Tencent AI

ISPO reduces critical reasoning failures in RLVR by transforming reward structures, leading to superior performance on complex reasoning tasks.

Jun 4, 2026

1w ago·also Ant Group, CUHK, Eastern Institute of Technology

OPRD closes the performance gap between student and teacher models while training 1.44x faster and using 54% less memory than traditional methods.

NUS1w ago·also CUHK, Eastern Institute of Technology, ZJU

SkillComposer enables language models to self-evolve skills in real-time, achieving up to +4.5 improvements on agent tasks compared to larger models.