Search papers, labs, and topics across Lattice.
1
0
3
MLLMs can get a surprising visual reasoning boost from a simple trick: adding just a dash of visually grounded self-supervision to instruction tuning.