Search papers, labs, and topics across Lattice.
2 papers published across 1 lab.
LLMs can now automatically slim down and future-proof mathematical proofs, achieving 70% compression and 60% faster compilation by strategically rewriting them.
Forget expensive human annotations: this unsupervised method trains reward models that steer LLM reasoning just as well as, or even better than, their supervised counterparts.