Weijie Qiu

Beijing University of Posts and Telecommunications

Papers on Lattice

Total citations

Topics

Research focus

Multimodal Models (1)RLHF & Preference Learning (1)

Frequent co-authors

Dai Guan (1)Junxin Wang (1)

Papers (1)

Mar 17, 2026

Mar 17, 2026·also DAMO, CAS

Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLMReward Models

Forget expensive LLM-as-judge checks: Proxy-GRM learns transferable rubrics for vision-language reward models with a lightweight proxy, achieving SOTA results with 4x less data.

Weijie Qiu, Dai Guan, Junxin Wang

Multimodal Models RLHF & Preference Learning

Search

Weijie Qiu

Research focus

Frequent co-authors

Papers (1)