Search papers, labs, and topics across Lattice.
2
0
4
4
Forget exotic attention mechanisms – MobileLLM-Flash achieves up to 1.8x faster LLM prefill on mobile CPUs by smartly pruning and adapting existing architectures for on-device use.
Forget handcrafted kernels: Empirical GPs learn flexible, data-driven priors directly from historical data, unlocking richer covariance structures.