Xiangyang Xue

DINO-VO's learned patch selection and differentiable bundle adjustment leapfrogs traditional heuristic feature extraction, achieving SOTA monocular visual odometry with impressive generalization.

Sijia Hu, Xin Gao, Junpeng Ma +2

Computer Vision Robotics & Embodied AI

Mar 15, 2026

Mar 15, 2026·also BAIR, Fudan, USC

OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer

Teaching robots to manipulate objects just got easier: OCRA learns directly from human demonstration videos by focusing on object interactions and incorporating tactile feedback.

Yuqian Fu, Siyu Lin, Hu Luo +2

Computer Vision Multimodal Models Robotics & Embodied AI

Mar 9, 2026

Mar 9, 2026·also Microsoft Research, Huawei, Shanghai Innovation, Yinwang Intelligent Technology

DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving

Forget trying to wrangle dynamic 4D scenes with recurrent networks – DynamicVGGT achieves state-of-the-art reconstruction accuracy using a surprisingly effective feed-forward approach.

Xiaolei Chen, Siyang Zhang, Zhounan Jin +4

Architecture Design (Transformers, SSMs, MoE)Computer Vision Robotics & Embodied AI

Feb 23, 2026

Hanyang Yu +3Feb 23, 2026·also Fudan, Futian Laboratory, Tencent AI

Universal Pose Pretraining for Generalizable Vision-Language-Action Policies

Pre-training on universal 3D poses lets robots learn new tasks from just 100 demonstrations, sidestepping the usual VLA efficiency bottleneck.

Hanyang Yu, Jingshun Huang, Yonggen Ling +1

Multimodal Models Robotics & Embodied AI Training Efficiency & Optimization

Feb 15, 2026

EgoSound: Benchmarking Sound Understanding in Egocentric Videos

MLLMs can "hear" a little, but EgoSound reveals they're still largely deaf to the nuances of sound in egocentric video, especially when it comes to spatial and causal reasoning.

Bingwen Zhu, Yuqian Fu, Qiaole Dong +4

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Search

Xiangyang Xue

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)