Search papers, labs, and topics across Lattice.
ByteDance Seed, The Chinese University of Hong Kong
1
0
3
Ditch the VAE bottleneck: Representation Forcing lets you train unified multimodal models to generate high-quality images directly from pixels, rivaling VAE-based approaches without the architectural constraint.