Search papers, labs, and topics across Lattice.
2
0
6
5
Heterogeneous agents can boost each other's performance in RL without coordinated deployment, achieving better results with less data than traditional methods.
Skip the expert-authored rubrics: AI feedback driven by learning progressions can be just as good.