Search papers, labs, and topics across Lattice.
Zhejiang Gongshang University
2
0
4
0
You can train a surprisingly effective medical imaging MLLM using *only* public data by generating knowledge-aware reasoning traces and enforcing self-consistency during reinforcement learning.
Current vision-language models are surprisingly bad at identifying common household safety hazards, but a new benchmark could change that.