Search papers, labs, and topics across Lattice.
1
0
2
Speculative decoding gets a throughput boost of up to 4.32x by using reinforcement learning to dynamically balance drafting and verification.