Search papers, labs, and topics across Lattice.
3
0
5
Even the best LLMs struggle with Olympiad-level combinatorics, achieving only 65.4% on a benchmark designed to expose their reasoning limitations.
Draft-OPD accelerates inference by over 5x while improving speculative decoding accuracy, transforming how draft models learn from target feedback.
OPD's "free lunch" of dense token-level reward may be an illusion, as teacher novelty, not just higher scores, drives successful distillation.