Search papers, labs, and topics across Lattice.
2
0
4
Stop hand-engineering your multi-agent LLM systems: UnityMAS-O lets you train them end-to-end with RL, unlocking surprisingly large gains, especially for smaller models.
Stop relying on absolute LLM scores for RLHF: relative comparisons via tournaments yield significantly better rewards for long-form generation.