Search papers, labs, and topics across Lattice.
Rutgers University
1
3
9
Ditch separate image and text encoders: UniFusion uses a single frozen VLM to generate and edit images, achieving better text-image alignment and zero-shot generalization.