Search papers, labs, and topics across Lattice.
2
0
4
6
Achieve faster VLA inference without retraining by using image and attention entropy to dynamically focus on the most relevant visual and textual information.
VLMs stumble with confusable objects in robotic manipulation, but CAICL guides them to focus on the right features, boosting success rates by focusing on task-relevant features.