Search papers, labs, and topics across Lattice.
1
0
2
3
Stop blindly trusting LLM-generated tests: ACES reveals which tests actually distinguish good code from bad, leading to better code evaluation without needing ground truth labels.