Search papers, labs, and topics across Lattice.
2
1
5
Today's best AI agents still fail more than half the time on real-world tasks combining vision, search, and coding, revealing critical gaps in reasoning and tool use.
Open-source web agents can now outperform GPT-4o on key web navigation tasks, thanks to a new dataset and model family that levels the playing field.