Search papers, labs, and topics across Lattice.
Introduction With the advancement of multimodal large language models (MLLMs) and coding agents [Claude, Code [si-etal-2025-design2code] and Web
1
0
4
Today's best multimodal agents still fall into "blind execution" traps when building websites from ambiguous user requests, revealing critical gaps in intent recognition and adaptive interaction.