Search papers, labs, and topics across Lattice.
Beijing Institute of Technology
2
0
5
Robots get a spatial-temporal reasoning boost with STARRY, a world model that aligns future predictions with action generation, leading to a significant jump in manipulation success.
Ditch the mask decoder: a single segmentation token can unlock competitive image segmentation directly from MLLMs.