Search papers, labs, and topics across Lattice.
4
0
5
5
Converting noisy, human-centric guides into self-evolving agent skills can yield performance improvements of up to 25.3 percentage points across diverse tasks.
Even the best multimodal models struggle to understand dynamic charts, achieving only 84.5% accuracy on the new ChartAct benchmark that requires interaction to reveal key information.
Coding agents can now evolve their own harnesses to outperform human-designed ones, thanks to a novel observability-driven approach.
OpenMobile proves that high-performing mobile agents can be trained on entirely synthetic, open-source data, closing the gap with closed-source models and enabling broader research.