Search papers, labs, and topics across Lattice.
X-LANCE Lab, Shanghai Jiao Tong University, China
1
0
3
0
Unlock SOTA audio understanding by jointly training on readily available clip-level descriptions and scarce frame-level annotations, bridging the gap between global semantics and local details.