Search papers, labs, and topics across Lattice.
UC San Diego 2 Zhejiang University
1
0
2
JetFlow breaks the speculative decoding speed ceiling, achieving up to 9.64x faster performance on complex tasks by aligning candidate tree generation with autoregressive models.