Search papers, labs, and topics across Lattice.
2
0
5
2
Speculative decoding can be sped up by >2x without sacrificing accuracy by rescuing previously rejected tokens that are semantically valid but lexically different.
Unlock the full potential of your GPU's Tensor Cores for 3D Gaussian Splatting with a GEMM-friendly blending transformation that delivers up to 2x speedups.