Search papers, labs, and topics across Lattice.
Nanjing University
1
0
2
JetFlow breaks the speculative decoding speed ceiling, achieving up to 9.64x faster performance on complex tasks by aligning candidate tree generation with autoregressive models.