Search papers, labs, and topics across Lattice.
1
0
3
Squeeze more out of your hardware: TSP lets you shard both weights and activations across the same devices, unlocking memory savings for long-context training and inference.