Search papers, labs, and topics across Lattice.
1
0
3
DiT activations are far more amenable to semi-structured sparsity than weights, unlocking significant inference speedups without sacrificing generation quality.