Search papers, labs, and topics across Lattice.
University of Illinois Chicago
2
0
3
3
LLM agents can now autonomously generate complex skills with multi-file dependencies, rivaling human-authored skills, thanks to a co-evolutionary verification process that doesn't need ground truth labels.
Even state-of-the-art LLMs struggle to adapt to mid-task changes in long-horizon web navigation, highlighting a critical gap in their ability to handle realistic user interactions.