Search papers, labs, and topics across Lattice.
3
1
8
3
Achieve more reliable and visually informative multimodal reports by enforcing factual grounding, citation fidelity, and cross-modal consistency throughout the workflow with a multi-agent system.
Forget finetuning video models for each robot: a single, action-free video world model can drive diverse robots when paired with a carefully designed inverse dynamics model.
Forget random sampling – this framework crafts targeted, multi-turn function-calling data that catapults smaller LLMs to state-of-the-art performance.