Search papers, labs, and topics across Lattice.
2
0
5
SearchSwarm reveals that effective delegation in LLMs can significantly boost performance on long-horizon tasks, achieving state-of-the-art results in complex research scenarios.
Asynchronous RL for LLMs doesn't have to sacrifice convergence for speed: DORA achieves 2-4x faster training by cleverly managing multiple policy versions during rollout.