Search papers, labs, and topics across Lattice.
1
0
3
VLMs can achieve superior visual reasoning by dynamically decomposing queries, extracting premise-conditioned visual latents, and reasoning through grounded rationales, outperforming even multimodal CoT methods.