Search papers, labs, and topics across Lattice.
1
0
2
Speculative decoding's speed boost just got a whole lot bigger: DIVERSED dynamically loosens the verification constraints, letting more good tokens through and accelerating inference.