Search papers, labs, and topics across Lattice.
Beijing University of Posts and Telecommunications
1
0
3
VLMs can achieve state-of-the-art video recognition by splitting temporal modeling experts into specialized roles for spatial understanding and motion processing.