Search papers, labs, and topics across Lattice.
4
0
6
4
Latent reasoning can beat explicit Chain-of-Thought – but only if you force it to learn causal dynamics via a visual world model, not just language.
Token-level Mixture-of-Experts, directly ported from LLMs, can actually *hurt* autonomous driving performance in VLA models; SAMoE-VLA fixes this with scene-adaptive expert selection, achieving SOTA results with fewer parameters.
VLMs can now leverage the power of 3D geometric understanding for autonomous driving tasks thanks to a simple plug-and-play module.
By decoupling generation and refinement experts within a masked diffusion VLA model, DriveFine achieves both flexible decoding and self-correction for autonomous driving.