Search papers, labs, and topics across Lattice.
Meta Superintelligence Labs -FAIR, Universit茅 Paris-Dauphine PSL
1
0
2
Extrapolating between code-generating RL agents trained on different unit test coverages unlocks better correctness-efficiency trade-offs than any single agent alone.