Search papers, labs, and topics across Lattice.
3
0
7
4
Forget static rubrics and expensive external models: EvoRubric co-evolves a single policy to generate both responses and the rubrics to evaluate them, outperforming traditional RLHF methods in open-ended generation tasks.
AgentDoG 1.5 proves you can achieve GPT-5.4-level agent safety with open-source models trained on just 1k samples, slashing deployment overhead by two orders of magnitude.
AgentSchool offers a powerful new way to simulate educational environments, moving beyond simple role-play to model learning as a dynamic state transition and providing a testbed for long-horizon memory and multi-agent coordination.