Search papers, labs, and topics across Lattice.
2
0
4
16
Forget static rubrics and expensive external models: EvoRubric co-evolves a single policy to generate both responses and the rubrics to evaluate them, outperforming traditional RLHF methods in open-ended generation tasks.
LLM agents can achieve 3x faster web search and higher accuracy by dynamically routing between multiple context management strategies.