Search papers, labs, and topics across Lattice.
Department of Information Engineering and Computer Science, University of Trento, Trento, Italy
1
0
3
Unlock human-interpretable video understanding without task-specific training: TF-SMOT leverages off-the-shelf vision-language models to achieve state-of-the-art semantic multi-object tracking.