Search papers, labs, and topics across Lattice.
5
0
4
By leveraging the complementary strengths of shallow and deep VFM features, Ideal dramatically enhances image reconstruction quality and sets new benchmarks in autoregressive image generation.
Reinforcement learning boosts multimodal performance, raising task scores and creating unexpected synergies between image generation and editing.
OmniGen-AR can seamlessly generate images from a wide array of conditions, outperforming existing methods that are limited to single-modality inputs.
ActiveMimic reveals that leveraging human egocentric video with active perception can bridge the performance gap with robot-pretrained models.
BiDPO achieves a remarkable boost in compositional fidelity for text-to-image generation, outperforming previous methods through innovative preference optimization techniques.