Search papers, labs, and topics across Lattice.
4
0
5
2
Forget prompt engineering: this new region proposal network spots objects across diverse datasets without *any* text or image prompts.
Forget billion-scale datasets: EvoTok achieves state-of-the-art image tokenization for both understanding and generation using a residual evolution process trained on just 13M images.
Current image editing models stumble when domain-specific knowledge is required, as revealed by a new benchmark spanning disciplines from natural science to social science.
DreamWorld achieves more world-consistent video generation by jointly modeling multiple heterogeneous dimensions of world knowledge, moving beyond surface-level plausibility.