Search papers, labs, and topics across Lattice.
2
0
4
Forget training wheels: this training-free method leverages uncertainty to guide vision-language models to the right image regions, boosting performance on detail-oriented tasks.
Forget sparse annotations: a new dataset and compact VLM show that dense, complete screen parsing supervision unlocks substantial gains in UI understanding and grounding, even for large foundation models.