Bowen Ping

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (1)RLHF & Preference Learning (1)

Frequent co-authors

Xiangxin Zhou (1)Penghui Qi (1)Minnan Luo (1)Liefeng Bo (1)

Papers (1)

Jun 9, 2026

NUS2d ago·also Tencent AI, XJTU

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Flow-DPPO outperforms traditional PPO methods by achieving higher rewards and greater training stability through a novel divergence proximal constraint.

Bowen Ping, Xiangxin Zhou, Penghui Qi +3

Multimodal Models RLHF & Preference Learning

Search

Bowen Ping

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)