Search papers, labs, and topics across Lattice.
1
0
3
Get 80% of your prompt length back without sacrificing accuracy using a diffusion-based pruning method that can mask multiple tokens at once.