Search papers, labs, and topics across Lattice.
3
4
6
8
Forget paired video-music training data: V2M-Zero aligns video and music by matching the *timing* of changes within each modality, not the content itself.
Compressing 60-second audio into just 788 tokens, this new autoencoder makes generative audio modeling far more tractable by slashing encoding time and latent rates.
Forget RLHF and DPO – DRAGON lets you fine-tune generative models with rewards that compare entire *distributions* of outputs, unlocking better control and quality without human preference data.