Search papers, labs, and topics across Lattice.
3
0
6
19
Hierarchical planning and self-reflection can finally wrangle AIGC tools into producing coherent, visually consistent webpages.
Today's best text-to-audio-video models may look and sound impressive, but they still struggle with basic physics, coherent speech, and even rendering text correctly.
Current image generation models fall far short of the mark when it comes to the structured and multi-constraint demands of real-world commercial design, as revealed by a new systematic benchmark.