Search papers, labs, and topics across Lattice.
2
0
5
Achieve real-time video understanding with transparent reasoning: \model{} aligns response timing with visual evidence, offering a breakthrough for online video LLMs.
By "dreaming ahead" with learned latent visual dynamics, LatentPilot achieves state-of-the-art vision-and-language navigation, demonstrating the power of future-aware reasoning without needing future observations at test time.