Search papers, labs, and topics across Lattice.
3
0
7
0
Forget scaling laws – this zero-shot navigation agent beats million-sample trained models by structurally unifying language, vision, and robot actions within the reasoning capabilities of pre-trained MLLMs.
Save up to 2.79x on LLM serving costs by intelligently distributing models across a diverse fleet of cloud GPUs.
Omni-modal LLMs can ace captioning and QA, but AVID reveals they're surprisingly bad at spotting audio-visual inconsistencies in videos, a crucial skill for trustworthy AI.