Search papers, labs, and topics across Lattice.
University of Science and Technology of China
2
0
4
0
Surgical robots can now perform complex tasks like knot tying and organ dissection with state-of-the-art accuracy, thanks to a new method that directly infers 3D spatial awareness from standard endoscopic images.
By encoding 3D point clouds in spherical coordinates, SoPE overcomes RoPE's limitations and significantly boosts spatial reasoning in 3D LVLMs.