Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Xiaomin Yu | Lattice

Xiaomin Yu

Papers on Lattice

1

Total citations

0

Topics

4

h-index

1

Research focus

Multimodal Models (1)RLHF & Preference Learning (1)Robotics & Embodied AI (1)Training Efficiency & Optimization (1)

Frequent co-authors

Sudong Wang (1)Weiquan Huang (1)Zuhao Yang (1)Hehai Lin (1)

Papers (1)

Apr 30, 2026

Apr 30, 2026·also Tsinghua AI

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Stop letting SFT ruin your LMMs: PRISM uses on-policy distillation to realign your model *before* RL, boosting performance by up to 6%.

Sudong Wang, Weiquan Huang, Xiaomin Yu +10

Multimodal Models RLHF & Preference Learning Robotics & Embodied AI+1

Keming Wu (1)

Chaojun Xiao (1)