Zhongyuan Wang

Wuhan University, National Engineering Research Center for Multimedia Software, Hubei Key Laboratory of Multimedia and Network Communication Engineering

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Computer Vision (1)Data Curation & Synthetic Data (1)Robotics & Embodied AI (1)

Frequent co-authors

Zhengqian Wu (1)Zhixian Liu (1)Aodong Chen (1)Jingyang Zhang (1)

Papers (2)

Jun 4, 2026

1w ago·also Hubei Key Laboratory of Multimedia and Network, National Engineering Research Center for Multimedia

StoryVideoQA: Scaling Deep Video Understanding with a Large-Scale, Multi-Genre and Auto-Generated Dataset

Current VideoQA models falter in understanding complex narratives, but StoryVideoQA and PlotTree redefine how we tackle deep video comprehension.

Zhengqian Wu, Zhixian Liu, Aodong Chen +6

Computer Vision Data Curation & Synthetic Data Multimodal Models

Apr 12, 2026

Apr 12, 2026·also BIT, BUPT, CAS, CAU +4

OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction

Robots can now learn contact-rich manipulation skills like humans by feeling the forces involved, thanks to a new multimodal interface that captures synchronized visual, tactile, and force data.

Yuanyuan Li, Chaoran Xu, Jiachen Zhang +4

Multimodal Models Robotics & Embodied AI

Search

Zhongyuan Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)