Search papers, labs, and topics across Lattice.
PolyU
2
0
3
2
GUI agents struggle in dynamic environments because they only see static screenshots, but DynamicUI's video-based approach with frame selection and action-conditioned refinement leaps ahead.
Existing GUI agents can parrot actions, but AutoGUI-v2 reveals they still lack a deep understanding of GUI functionality and struggle to predict the outcomes of even simple interactions.