Search papers, labs, and topics across Lattice.
3
0
8
Grounding boosts spatial reasoning in VLMs: explicitly linking language to 2D and 3D scene elements lets models decompose complex spatial problems and improve performance even on non-grounded tasks.
Get up to 1.79x faster ViT inference on high-resolution images without sacrificing accuracy by surgically replacing full-attention blocks with cheaper alternatives *after* pre-training.
Asynchronous RL for LLMs can be sped up 2.5x by explicitly controlling policy-gradient variance, without sacrificing synchronous performance.