Search papers, labs, and topics across Lattice.
Control Science and Engineering, Zhejiang University
1
0
2
Diffusion models can bridge the semantic gap between abstract action labels and complex video content, leading to state-of-the-art performance in open-vocabulary temporal action detection.