Search papers, labs, and topics across Lattice.
University of California, Berkeley
1
0
3
4
Evolving coding problems can restore meaningful evaluation metrics for frontier models, revealing their true capabilities and enabling self-improvement.