Search papers, labs, and topics across Lattice.
UNC Chapel Hill
2
0
6
0
VLMs confidently hallucinate answers to spatial reasoning questions even when visual evidence is occluded or misleading, achieving near-random performance in identifying viewpoints that could resolve the ambiguity.
Decomposing GUI agent trajectories into verifiable milestones and auditing the evidence chain yields a 10% boost in RL training performance, outperforming single-judge reward systems.