Search papers, labs, and topics across Lattice.
3
0
7
0
Ditch slow, irrelevant text-based reasoning: VISUALTHINK-VLA uses visual tokens to speed up vision-language-action policies by 22x while boosting accuracy.
LLMs stumble in multi-turn conversations not just because of context length, but because they poison themselves with their own past mistakes – and you can fix it with self-distillation.
InstructSAM equips SAM with high-level instruction understanding and compositional reasoning for multi-instance segmentation, all without modifying SAM's core architecture.