Search papers, labs, and topics across Lattice.
2
0
5
3
LLMs can guide dense retrieval far more effectively by actively exploring the embedding space with Gaussian Processes, outperforming standard reranking even with a limited LLM budget.
Jointly training LLMs to reason and refine their answers unlocks significant performance gains, outperforming standard policy optimization by up to 11.5 points on AIME.