Search papers, labs, and topics across Lattice.
2
0
4
4
Forget task-specific overfitting: training coding agents on atomic skills unlocks surprisingly broad generalization to complex software engineering tasks.
LLMs struggle to translate code into formal specifications, as evidenced by their poor performance on the new Model-Bench benchmark, revealing a critical gap in their ability to support formal verification.