Ye Pan

Robots can now learn complex manipulation tasks directly from human demonstrations using only a pair of smart glasses, achieving zero-shot transfer without specialized hardware.

Yanwen Zou, Yanwen Zou, Chenyang Shi +8

Computer Vision Multimodal Models Robotics & Embodied AI

Mar 12, 2026

Ye Pan +8Mar 12, 2026·also HKUST

EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next

Current MLLMs are surprisingly bad at understanding human intent in egocentric videos at a step-by-step level, achieving only 33% accuracy on a new benchmark designed to prevent future-frame leakage.

Ye Pan, Chi Kit Wong, Chifai Wong +6

Eval Frameworks & Benchmarks Multimodal Models Robotics & Embodied AI

Search

Ye Pan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)