Search papers, labs, and topics across Lattice.
2
0
5
0
Save up to 2.79x on LLM serving costs by intelligently distributing models across a diverse fleet of cloud GPUs.
Tensor program optimization just got a whole lot faster: Prism achieves up to 2.2x speedup over existing superoptimizers while *also* reducing end-to-end optimization time.