Search papers, labs, and topics across Lattice.
Shanghai Jiao Tong University
1
0
3
Structured supervision can enable 3B VLMs to outperform 32B VLMs on dense-scene reasoning tasks, suggesting a path to efficient and reliable visual understanding.