Search papers, labs, and topics across Lattice.
2
0
6
Rewarding *correct* answers in multimodal reasoning can actually *worsen* reasoning quality, but a simple groupwise ranking of solution trajectories significantly boosts reliability.
Community-sourced tools aren't just more diverse, they can substantially boost agent performance, suggesting that the quality of the tools themselves, not just how agents use them, is a major bottleneck.