Search papers, labs, and topics across Lattice.
1
0
3
8
Forget slow attention: FlashPrefill achieves a staggering 27x speedup in long-context prefilling by instantly discovering and thresholding sparse attention patterns.