Search papers, labs, and topics across Lattice.
Zhejiang University
2
0
4
ROGLE achieves a breakthrough in Text-Based Person Search by automatically generating fine-grained supervision, outperforming existing models on challenging long-form queries.
Seedance 2.0 leapfrogs existing models by unifying multi-modal inputs (text, image, audio, video) into a single architecture for generating high-quality, longer-duration audio-video content.