Search papers, labs, and topics across Lattice.
2
15
3
5
Ditching pixel-space translation unlocks a unified model (LatentUM) that reasons across modalities with SOTA results, opening doors to more efficient and aligned visual AI.
Achieve semantically coherent image compositions by mixing layout-focused and appearance-focused visual representations in a diffusion model's cross-attention.