Lumeng Wu

The University of Hong Kong

Papers on Lattice

Total citations

Topics

Research focus

Multimodal Models (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Liujie Zhang (1)Benzhe Ning (1)Xiaoyan Yu (1)Weihang Chen (1)

Papers (1)

Apr 13, 2026

Apr 13, 2026·also HKU, NTU, USTC

Relax: An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Omni-modal RL post-training just got a whole lot faster: Relax delivers up to 2x speedups over existing systems, even for massive MoE models, without sacrificing reward convergence.

Liujie Zhang, Benzhe Ning, Xiaoyan Yu +3

Multimodal Models RLHF & Preference Learning Tool Use & Agents

Search

Lumeng Wu

Research focus

Frequent co-authors

Papers (1)