Search papers, labs, and topics across Lattice.
3
0
7
Don't let valuable steps in failed trajectories go unnoticed: GraphGPO leverages state-transition graphs for fine-grained credit assignment in agentic RL, boosting performance and efficiency.
Cycle-consistent learning unlocks self-improvement in vision-language models, enabling them to reason about their own generations and boosting performance across understanding and generation tasks.
A new family of GUI agents, GUI-Owl-1.5, leapfrogs existing open-source models on 20+ GUI benchmarks, proving that multi-platform, real-time GUI automation is now within reach.