Search papers, labs, and topics across Lattice.
IDEA Research
2
0
5
Bayesian-Agent transforms how LLM agents evolve skills, achieving up to 100% success on complex benchmarks through a novel posterior-guided optimization approach.
NL2SQL evaluation has a new champion: ROSE, an intent-centered metric that aligns 24% better with human judgment than existing metrics.