Search papers, labs, and topics across Lattice.
Sun Yat-sen University
2
0
4
LLMs aren't always needed: CelerLog shows you can get SOTA log parsing with a hybrid approach that's up to 18x faster and cuts token costs by 94%.
Forget slow attention: FlashPrefill achieves a staggering 27x speedup in long-context prefilling by instantly discovering and thresholding sparse attention patterns.