Search papers, labs, and topics across Lattice.
1
0
3
Text-to-video diffusion models often ignore crucial semantic tokens, but a simple re-weighting of training distributions and attention maps can dramatically improve generation quality.