Xiaowen Zhang

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (1)Multimodal Models (1)RLHF & Preference Learning (1)

Frequent co-authors

Licheng Jiao (1)Qing Li (1)

Papers (1)

Feb 12, 2026

STVG-R1: Incentivizing Instance-Level Reasoning and Grounding in Videos via Reinforcement Learning

Forget complex cross-modal alignment: this method uses visual prompting with instance IDs and reinforcement learning to achieve a 20.9% m_IoU improvement on spatial-temporal video grounding.

Xiaowen Zhang, Licheng Jiao, Qing Li

Computer Vision Multimodal Models RLHF & Preference Learning

Search

Xiaowen Zhang

Research focus

Frequent co-authors

Papers (1)