Search papers, labs, and topics across Lattice.
University of Chinese Academy of Sciences (UCAS), New Laboratory of Pattern Recognition (NLPR), CASIA, State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), CASIA
2
0
4
1
Existing GUI agents can parrot actions, but AutoGUI-v2 reveals they still lack a deep understanding of GUI functionality and struggle to predict the outcomes of even simple interactions.
You don't need billions of parameters to accurately ground GUI elements: GoClick, a 230M parameter model, matches the performance of much larger models, opening the door for on-device GUI agents.