Search papers, labs, and topics across Lattice.
3
0
7
7
Today's best AI agents can only complete 33% of common online tasks like booking appointments or filling out job applications, revealing a significant gap between current capabilities and real-world utility.
Image generation models ace photorealistic art but still choke on screenshots and infographics, highlighting a critical gap in real-world applicability.
MLLMs may ace your visual question answering, but VisPhyWorld reveals they're still struggling to actually *simulate* physics.