Search papers, labs, and topics across Lattice.
1
0
3
2
Training speculative decoding models just got an order of magnitude faster, unlocking real-world deployment with a new open-source framework and a suite of production-ready draft models.