Search papers, labs, and topics across Lattice.
mm-webagent
2
0
5
Hierarchical planning and self-reflection can finally wrangle AIGC tools into producing coherent, visually consistent webpages.
Today's best text-to-audio-video models may look and sound impressive, but they still struggle with basic physics, coherent speech, and even rendering text correctly.