Search papers, labs, and topics across Lattice.
Tencent
1
0
2
RL agents can learn to write stronger formal specifications by using automatically generated negative tests as a reward signal for completeness, outperforming standard verification-based rewards.