Search papers, labs, and topics across Lattice.
Westlake University
2
0
3
Achieving up to 6.4x faster autoregressive image generation without sacrificing quality could redefine efficiency benchmarks in the field.
Freezing your VQ decoder during text-to-image post-training might be why your images are getting worse even as your CLIP scores improve.