Search papers, labs, and topics across Lattice.
Monash University
2
0
5
Generating realistic human-object interaction videos from text, images, audio, *and* pose is now possible, opening the door to automated content creation workflows.
Analytical diffusion models can now scale to ImageNet-1K without training, thanks to a clever "Golden Subset" selection strategy that avoids full-dataset scans.