Search papers, labs, and topics across Lattice.
3
0
7
Forget rigid shot-music mappings: BEAT's elastic alignment framework finally captures the dynamic rhythm of professional movie trailer editing.
By intelligently pruning attention heads based on their spatial or temporal roles and adaptively routing denoising steps through the network, PARE achieves significant computational savings in video generation without sacrificing quality.
Task-aware localization, using attention cues from both source and target image streams, significantly reduces over-editing in instruction-based image editing, even when applied to strong diffusion transformer backbones.