Search papers, labs, and topics across Lattice.
1
0
4
8
Decoding LLMs with dynamic adapters doesn't have to be 2.5x slower: AdaFuse slashes latency by 2.4x with token-level pre-gating and fused kernel optimization.