Search papers, labs, and topics across Lattice.
3
0
5
13
Generative videos might look great, but a new metric reveals they often suffer from jarring 3D spatial inconsistencies that existing metrics miss.
DriveTok achieves unified multi-view reconstruction and understanding by learning scene tokens that integrate semantic, geometric, and textural information, outperforming existing 2D tokenizers in autonomous driving scenarios.
WaterVIB achieves superior zero-shot watermark robustness against generative AI attacks by learning a minimal sufficient representation, sidestepping the fragility of existing methods that entangle watermarks with easily-rewritten high-frequency cover textures.