Search papers, labs, and topics across Lattice.
2
0
4
Fine-tuning on the DeNovoSWE dataset boosts long-horizon software engineering performance by over 40 percentage points, revealing the potential of LLMs in complete repository generation.
Autonomous ML research agents achieve significantly better long-horizon performance by maintaining durable state through a shared workspace, suggesting that orchestration and memory are more critical than raw reasoning power.