Search papers, labs, and topics across Lattice.
2
0
5
Even state-of-the-art VLMs exhibit systematic failures in reasoning about the physical feasibility of actions in 3D environments, despite high semantic confidence.
A confidence-based gating mechanism lets a 14B parameter reward model outperform 70B parameter models, achieving a new accuracy-efficiency Pareto frontier.