Search papers, labs, and topics across Lattice.
National Key Laboratory for Novel Software Technology, Nanjing University, China, School of Artificial Intelligence, Nanjing University, China
1
0
2
3
Stop blindly trusting LLM-generated tests: ACES reveals which tests actually distinguish good code from bad, leading to better code evaluation without needing ground truth labels.