Search papers, labs, and topics across Lattice.
3
0
5
By generating and compressing Chain-of-Thought reasoning, TRACE achieves state-of-the-art multimodal retrieval and learns to adaptively route queries through reasoning only when necessary, balancing accuracy and efficiency.
Training-free zero-shot image retrieval just got a whole lot better: WISER's "retrieve-verify-refine" pipeline achieves state-of-the-art results by intelligently fusing text-to-image and image-to-image retrieval.
By mimicking human visual attention, TraceVision significantly boosts spatial reasoning in vision-language models, outperforming existing methods on trajectory-guided tasks.