Search papers, labs, and topics across Lattice.
Wuhan University, National Engineering Research Center for Multimedia Software, Hubei Key Laboratory of Multimedia and Network Communication Engineering
2
0
4
Current VideoQA models falter in understanding complex narratives, but StoryVideoQA and PlotTree redefine how we tackle deep video comprehension.
Robots can now learn contact-rich manipulation skills like humans by feeling the forces involved, thanks to a new multimodal interface that captures synchronized visual, tactile, and force data.