Search papers, labs, and topics across Lattice.
Ditch the computational bloat: DeltaWorld slashes parameters by 35x and FLOPs by 2000x while generating more realistic video futures.
An end-to-end system extracts funny scenes from movies with 87% accuracy, opening new avenues for automated content repurposing.
Object hallucination in MLLMs can be significantly reduced by simply masking salient visual features during contrastive decoding.