Search papers, labs, and topics across Lattice.
2
0
5
15
Latent reasoning can now outperform explicit reasoning in complex tasks, thanks to a new RL method that stabilizes training by explicitly handling issues like invalid latent states and misaligned token-level updates.
Forget human clicks: training retrieval models directly from agent behavior unlocks significant gains in task success and efficiency for LLM-powered search agents.