Search papers, labs, and topics across Lattice.
1
0
3
Agentic search gets a meta-RL upgrade: MR-Search learns to reflect on past failures, leading to 9-19% improvements over standard RL baselines across diverse benchmarks.