Search papers, labs, and topics across Lattice.
2
0
7
18
Decoding LLMs with dynamic adapters doesn't have to be 2.5x slower: AdaFuse slashes latency by 2.4x with token-level pre-gating and fused kernel optimization.
GUI agents struggle with semantically ambiguous actions, but HATS tackles this by iteratively exploring hard cases and refining instruction alignment, leading to significant performance gains.